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A CONFIGURATIONAL INTERPRETATION OF 
MEMORY AS RELATED TO THE LEARNING PROCESS? 


by 


Roger M. Bellows 
The Ohio State University 


I. HISTORICAL ASPECTS 


Our dispositions are so permeated and 
set by educational, social, and lancuage 
mores that the notions as to what consti- 
tutes memory are definitely grounded in past 
usage, and in obsolete custom insofar as 
an objective treatment of this problem is 
concerned. It is as a result usually dif- 
ficult for the average reader to break away 
from these bonds of social heritage. A fore- 
word may save labor by expressing purposes 
and disclaiming unwarranted inferences. A 
few pages further on the reader will find a 
more or less radical departure from the cus- 
tomary conception of this psychological 
problem. The memory enigma and kindred 
problems are formulized as growing out of 
the mere fact that systems of differenti- 
ated protoplasm are related to environments. 
The problem of memory, and of learning al- 
so, are fundamentally resultant properties, 
or supra-summative products of this rela- 
tion. 

The classical notions as to the nature 
of memory and its physiological correlate, 
out of which our common concepts of the 
process eventuated, are many and varied. 
Perhaps the earliest clear-cut theory of 
memory is the doctrine of transmigration of 
the Pythagoreans. Pythagoras believed that 
he remembered things which had occurred in 
a former state of existence. He considered 
memory a function or faculty of the soul. 





The soul was a storehouse for events which 
occurred in a former existence, Its func- 
tion could continue beyond the limits of 
the present life. His theory is quite sim 
ilar to, and anticipates, Plato's doctrine 
of reminiscence. 

According to Burnham (4) Plato sug- 
gests as a definition of memory "the power 
which the soul has of recovering when by it- 
self, some feeling which she experienced 
when in company with the body." In one 
place Plato likens the soul to a waxen tab- 
let which may be writ upon by experience. 
Socrates calls this conceptural example a 
"waxen figment." Zeno, the Stoic, is said 
to have adopted this scheme. In this view 
is the idea that our empirical experience 
plays a part and that the environment is in- 
fluential in ow psychological processes. 
For Plotinus memory is a "faculty of ani- 
mals, men, and angels.” These early ani- 
mistic writers were impressed by the fact 
that native or genetic forces somehow play 
a part in our experiences. 

The ancient thinkers were not slow to 
guess that some physical or extended sub- 
stance might be the explanation of the puz- 
zle. Diogenes of Appolonia “explained for- 
getting by stating that it was due to an 
arrest of the equal distribution of air 
throughout the body, because he noted that 
easier breathing follows the recall of what 
was forgotten" writes Burnham (4). Par- 
menides, the Eliatic, thought the problem 





1. The writer is indebted to Dr. R. D. Williams for his helpful interest and suggestive criticisms. 
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rested upon the way the light and heat and 
the dark or cold are distributed in the 
body. The physiological correlate of mem 
ory was seriously considered by the Greek 
philosophers. 

During the seventeenth century Des- 
cartes clearly severed the mind from the 
body in his psychophysical interactionism. 
In Descartes is to be found the basis of 
the present-day theories of memory. While 
memory is a function of the soul for Des- 
cartes it depends upon the extended phys- 
ical substrate for its initiation as a men- 
tal or soul act. His theory of memory is 
similar, in a somewhat gross way, to the 
doctrines of conditioning of our contempo- 
rary theories, which depend upon the mechan- 
istic postulate of preferential nerve path- 
ways. Descartes' concept of enlargement of 
the pores of the brain by use is comparable 
to our notion of the reduction of synaptic 
resistance due to a growth or electro- 
chemical change in the endings of the nerve 
processes at the synapse. Examples of such 
notions are Kapper's “law” of neurobiotax- 
is, Bok's of stimulogenous fibrillation, 
and the Helmholtzian double layer ioniza- 
tion theory. These theories all attempt to 
account for memory on the basis of the de- 
velopment of permeability of discrete nerve 
pathways. They are atomistic in as much as 
they hold that memory is a result of the 
growth of parts. 

For Descartes memory depended upon two 
things: innate disposition and impression. 
The innate faculty notion was lée-gely dis- 
carded by the advent of the English as- 
sociationism beginning with Hobbes and 
Locke. Of all the extent theories which 
impregnate thought upon this problem today 
associationism is perhaps the most virile. 
These philosophers took over the atomic 
dogma from the Newtonian physics. They as- 
sumed that the mind was made up of ele- 
ments, and that to explain the adherence 
or association of these atoms was to ex- 
plain memory. They were interested, ac- 
cordingly, in the force which held these 
mental elements (which were at first ideas, 
and later sensations and images) together. 
This force was named. It was called the 
force of association. The English systems 
of psychology, except those of such writ- 
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ers as Ward and Stout, were built around 
this mysterious “force.” Even Hume yielded 
to this type of interpretative principle in 
his concept of "gentle force." 

The historical notions of memory might 
be divided into three parts. First we have 
those which insist that memory is a manifes- 
tation of some innate psychic faculty, or 
spiritual entity. Next we have those views 
which postulate an adhesiveness of elements, 
or the atomic associationistic schema of the 
English associationists. To these two views 
we note also an extension, in contemporary 
theory, of the atomic view. This stand- 
point is based upon the physiology of neuro- 
muscular units. 

The extension of the atomic scheme 
came about as a result of Spencer's empha- 
sis upon reflex units in the nervous system, 
and was enhanced by the Sheringtonian and 
Pavlovian evidence for mechanistic condi- 
tioning. As a result of this emphasis upon 
reflex units, two present-day schemes of ex- 
plaining retention were coined. The sim 
plest of these theoretical trends is. the 
Watsonian frequency-recency doctrine. The 
other theory of "stamping in” neural bonds 
is the Bentham-to-Thorndike pleasure-pain 
doctrine. 

Watson was perhaps the most influen- 
tial exponent ot the formation of preferen- 
tial nerve pathways by repetition. He 
states, "So, instead of speaking of memory, 
the behaviorist speaks of the retention of 
a given habit in terms of how much skill has 
been retained and how much has been lost in 
the period of no practice. We do not need 
the term ‘memory’, shot through as it is 
with all kinds of philosophical and subjec- 
tive connotations" (33, p. 179). He "ex- 
plains" memory on the basis of verbal and 
manual habits which have been formed to- 
ward particular sets or pairs of objects or 
persons. The development of these habits 
of bodily and verbal responses (postured 
responses) is assumed to take place by the 
mechanistic or atomistic theory of condi- 
tioning. Due to our original acquaintance 
(his practice period) with an object or 
person, the repetition of a verbal stimu- 
lus would "re-establish the whole of the 
old habit." For Watson memory is, then, 
the retention of verbal or manual habits, 
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i.e., learning by conditioning, or the for- 
mation of preferential pathways in the 
nervous system, assuming these pathways to 
be discrete and specific in nature. 

The doctrine that pleasure is the 
prime factor of motivation of all human ac- 
tivity had its origin in the writings of 
‘Aristippus and Epicurus.; Bentham and 
J. S. Mill are its more recent exponents. 
With Spencer the same doctrine was framed 
in, and apparently fortified by, the thesis 
of differential nerve impulse conduction 
over reflex units. Spencer maintained that 
pleasantness was an accompaniment of in- 
creased nervous activity and that pain ac- 
companies decreased nervous activity; and 
that success, being pleasant, caused a 
greater permeability to subsequent diffuse 
nervous discharge at the time of the later 
trial. Thus retentivity is said to bea 
result of the pleasantness of the previous 
event. A. Bain and also J. M. Baldwin ac- 
cepted this doctrine practically as forme- 
lated by Spencer, according to Cason (6). 
The present-day vestiges of Thorndike's al- 
leged law of effect are rooted in this his- 
torical tradition. 


II, THE PROBLEM RESTATED 


(a) The role of the organism 





‘The views of memory have come more and 
more to point to the protoplasmic substrate 
as the fundamental basis of the phenomenon, 
and ever away from a spiritual or mentalis- 
tic scheme. Further, the theories of mem- 
ory have taken on a more definite aspect in 
regard to the particular physiological 
parts, or mechanisms, involved. 

A question here arises concerning the 
ultimate scientific value of this latter 
trend. Can one consider memory as result- 
ing from the activity of parts as such, or 
is memory something over and above mere 
physical stuff? Many writers appear to 
think that this is the case. Some contem- 
porary psychologists even go to the extent 
of postulating hormic and spiritual schemes 
comparable with those of the early Greek 





| p. 561). 
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thinkers, because there is an indication 
that physiological processes per se do not 
account for the phenomenon. On the other 
hand most present-day writers hold to the 
belief that the process is a result of the 
activation of nervous pathways, and traces 
in the neural system. While this stand- 
point has gained in popularity it is per- 
haps no more productive than the early ani- 
mistic dogma in lighting the way toward an 
insight into the nature of this psychologi- 
cal process. 

Memory cannot be explained upon the 
basis of anything we now know about the 
nervous system and its function. Lashley 
insists that we know little more of the ac- 
tion of the nervous impulse than did Des- 
cartes (almost three hundred years ago and 
nearly two centuries before the formulation 
of the cell doctrine) when he described the 
action of his animal spirits. 

The experimental evidence of the work 
of both Franz and Lashley indicates a lack 


| of specificity in function of neural proc- 
| esses, 
| coming more and more into its own. Lashley's 
' work indicates the vicariousness of nervous 
| tissue (20). 
| conception of limited conduction paths 

| through the gray matter can be applied even 


The organismic thesis appears to be 


"It is doubtful whether the 


in the case of the spinal reflexes" (19, 
In the same place in regard to 
Descartes! notion of enlargement of the 
pores in the brain by the passage of his 
animal spirits Lashley says, "His statement 
sounds remarkably like the reduction of syn- 


| aptic resistance by the passage of the 


nerve impulse.” In this connection Kantor 
says, "As we have had occasion to see, .... 
the neural theory advanced to support and 
make plausible the revivals and storehouses 


| has contributed nothing solid to psychologi- 


cal science" (15, p. 117). In opposition to 
the doctrine of discrete function of the 
nervous system Irwin insists that "the re- 
flex chain theory of the physiologist did 
not seem to fit the development of speci- 
ficity of activity” (13, p. 128). 





i. For a general historical exposition of the hedonistic view the reader may be referred to Lec 
European Morals, Ch. 1, and to Watson's Hedonistic Theories from Aristippus to Spencer. 


key's History of 








Se ee aoe ee 


4 JOURNAL OF EXPERIMENTAL EDUCATION 


(b) The term--"memory” 





The term memory is an abstraction. 
Memory is not a tangible entity. It results 
as a psychological process, from the inter- 
functioning of the organism with the forces 
of the environment, but it cannot be found 
within the organism or in the environment. 
It does not exist as a part of the organism 
nor does it eventuate from the mere activity 
of a part or parts of the protoplasmic sys- 
tem which is involved. 

To exemplify the thought of the above 
paragraph one might draw an analogy from the 
word “health.” Parts of the body alone can- 
not make up health because the termis al- 
so, like memory, a linguistic abstraction. 
The connotation of the word results from so- 
cial usage. Every organ of the body might 
be sound and fit to do its work, but cannot 
do so because of faulty integration. Health 
is a state of togetherness of function. It 
eventuates when all the forces which are in- 
fluential in bringing about the abstract 
property of living things--which we have 
named health--are working in harmony. Health 
is not to be found in the parts, but in the 
functioning whole together with all the 
natural forces which interact in bringing 
about this condition. 

In the abstract term "memory" is the 
connotation of a carrying over in time of a 
psychobiological configuration. One cannot 
designate the stimlus, either external or 
internal, as being of value in describing 
this carry-over in time from one trial or 
event to the next. The word stimulus is am 
biguous in connection with such a psycholog- 
ical problem as it is an artifact, another 
part which has been abstracted from a total 
psychological process. The stimulus or the 
stimulating situation cannot be considered 
as being separate from the total process. 
The line of demarcation between stimulus and 
response cannot be pointed out. Neither is 
one organ or system of parts (e.g., the 
nervous system) important by itself, nor is 
the organism of itself of sufficient impor- 
tance in a psychological process to account 
for complex psycholorical phenomena. Rather 
the organism must be considered as it is re- 
lated to the forces of the environment. Upon 
this relation the problem of memory depends. 
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(c) The nature of the psychological process 





Memory is a supra-summative property 
of living and growing protoplasm. This prop- 
erty depends upon the organismic-environmen- 
tal relation. As such it is not necessary 
to employ an innate faculty idea to de- 
scribe and explain the process. Neither is 
it desirable to adhere to a system of ele- 
mentarism or atomism in an adequate state- 
ment of the problem. 

Memory depends upon the organismic- 
environmental relation. This statement in- 
volves several interesting facts. (1) On the 
side of the organism memory depends upon a 
physiological state. The physiological cor- 
relate of memory, which is the fact of re- 
tentivity, might be defined as a particular 
degree of physiological differentiation of 
the organism as a whole. (2) Memory itself 
is defined as a residual dynamic relation, 
which is a higher level of physiological 
growth, which exists after the partial or 
complete resolution of an organismic- 
environmental tension. (3) Retentivity or 
memory is one condition for the learning 
process. Let us see what is included in 
these three statements. 

Memory may be said to depend upon a 
physiological state of degree of physiolog- 
ical differentiation of the organism as a 
whole. Without a sufficiently mture ani- 
mal there could be no memory process for a 
given situation, for tissue would not be so 
differentiated and irritable, and integra- 
tion would not be sufficient to bring about 
the organismic state of dynamic relation. In 
the older nomenclature the organism would 
not have an experience or perception. Amoe- 
bae cannot remember a sound, for example, 
because tissue has not evolved to a suf- 
ficiently high level of specialization so 
that the sound affects this organism; that 
is, it does not bring about a condition of 
dynamic relation. In the human being tis- 
sue is so evolved and irritable, and inte- 
gration is relatively so complete that many 
sounds can set up this condition of stress- 
strain. 





The more irritable the tissue of the 
organism the more diverse are its experi- 
ences. More of the energy changes of the 
environment influence the protoplasmic 
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system. A three year old child is said to 
"forget" his house number. The vestigial 
dynamic relation is not present in the lat- 
ter case, either because no such relation 
was formed due to the fact that the degree 
of physiological differentiation and inte- 
gration was not sufficient to bring about 
this state of organismic imbalance, i.e., 
"house number" was not experienced, or be- 
cause the residual dynamic relation was so 
weak as not to enable the child to recall 
the house number. 

Ability to recall in the experimental 
usage is the criterion of the psychological | 
act of remembering, so that we say no remem | 
bering has taken place if the subject can- | 
not recall the material. In regard to re- 
call as a criterion of memory, and in sup- 
port of the notion of vestigial dynamic re- 
lations as a description of memory, cases 
of unusual recall under extraordinary or- 
ganismic-environmental relations are in 
point. Warren (32) reported the case of an 
elderly man who returned to his Alma Mater 
to attend the commencement exercises. Under 
the strain and excitement of the experience 
he arose upon occasion and recalled word for 





word his freshman oration which he had de- 
livered many years before, and which he had 


not recalled since. In the writings of 
Freud and his followers are other startling 
examples of latent memory. 

The conditions of memory may be said 
to be the same as those which enable the in- 
dividual to be placed under psychobiologi- 
cal tension or undergo a condition of dy- 
namic relation. Without a state of dynamic 
relation no memory would be evidenced in 
behavior by recall or ability to recognize, 
the latter being a common criterion of mem- 
ory. To explain the "why" of memory is to 
state the conditions of the formation of 
this organismic-environmental.tension. We 
shall now examine the basic conditions of 
memory, or the stage when it is set for 
this psychological process. 

In the first place the subject, either 
animal or man, must have attained a certain 
degree of physiological maturation. This 
degree of irritability of tissue is the 
fundamental condition of memory. When we 
recall that memory is an abstraction it is 


| tal conditions. 





evident that in one sense this basic condi- 
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| tion for the process is the physiological 


correlate of the process itself. That is 
to say, by movement and the life processes 
animals grow and become differentiated in 
time. By this very growth animals are eé6n- 
abled to be influenced by more and more of 
the energy forces of the environment which 
surround their sensitized tissue. 

By virtue of differentiation and spe- 


| cialization of tissue organisms are poten- 
| tially under a strain, or have dynamic re- 
| lations, at all times. 
| potential unsatisfied needs. 


That is, they have 


This statement simply implies that ani- 
mals have more experiences when they are 
more differentiated. Animals low in the 


| phylogenetic scale, such as protozoa, are 
_ influenced by environmental forces. Amoeba 
| Proteus shows, upon observation, phototropic 


and thigmotactic behavior due to changes of 


| light intensity and pressure of the environ- 
| mental envelope of their surface membranes. 


Iron, too, rusts under certain environmen- 
But these systems are rela- 
tively simple and of low grade of differen- 
tiation when compared with the human. They 
cannot be influenced by changes of 50 double 
vibrations in the sounds made by vibrating 
tuning forks, for example. In the case of 
the human and many of the infra-human or- 
ganisms much smaller changes in the energy 
of the environmental forces which play upon 
their specialized tissue will bring about 
activity which indicates the presence of a 
strained condition of dynamic relation. Dy- 
namic relations cannot be set up without 
differentiation of tissue, and the degree of 
this physiological growth mst be adjusted 
to the relative intensity and kind of ener- 
gy change. 

There are three problems which are re- 
lated closely to that of memory, and which 
may be interpreted from the viewpoint of 
configuration, or dynamic relations. One 
of these is vicarious selection--the fact 
that one event will be remembered instead 
of another, or that one movement will be 
retained in patterned behavior while an- 
other will drop out. A second problem, 
which in part accounts for vicarious selec- 
tion, is that of motivation. The third 
deals with the relation of memory, as here 
conceived, to the learning process. 
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The problem of vicarious selection. 
Shortly after the beginning of the nine- 
teenth century Thomas Brown was concerned 
with the fact that some things would be re- 
membered when others would be forgotten. He 
sought the reasons for this fact. Finally 
he postulated additional or secondary "laws" 
of association, to account for it. He said 
that those experiences which were originally 
most intense, frequent, recent, and vivid 
would take precedence in the stream of men- 
tal states at the time of recall over those 
which were less so. He still dealt with 
the intangible, abstract "force" of associ- 
ation. In regard to vicarious selection as 
improvement of behavior, Cason and others 
have criticized frequency and recency as a 
basic explanatory factor (5, 16, 17, 18, 
34). Cason has recently contributed a rath- 
er complete summary of the defects of the 
"law of effect" based upon experimental evi- 
dence (6). 

Complex bio-social factors influence 
dynamic relations in the case of the more 
differentiated human subject. The fact 
that a subject will remember one person or 
event and not another is dependent upon the 
first or fundamental condition of memory 
treated above, and also upon motivational 
influences. These latter factors might be 
referred to as determiners of degree of 
sureness of recognition and recall. 

Motivation. In the factor of motiva- 
tion is included the fact that all organisms 
have potential needs. Adams, for example, 
states that needs “are universal properties 
of protoplasm" (1). These needs may be di- 
vided into two classes. They may be pri- 
mary biological needs, @.g., food and oxy- 
gen; or they may be secondary or social. 

The primary needs are factors influencing 
the behavior of all organisms. Due to the 
fact that the higher animals live together 
and communicate with one another, the more 
intricate societal type of need is in evi- 
dence. Cooperative life results from the 
biological fact of growth and differentia- 
tion--tissue in the instance of the higher 
organisms has evolved to such a special- 
ized sense-organ status that intercommunica- 
tion is possible. 

Examples of the social type of need are 
the following: approbation, information, 














| motivational factor. 
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health, wealth, property, power, reputation, 
the aesthetic, etc. Biological needs may 
conflict with social ones. Sex motivation 
is often contrary to social mores and cus- 
toms. Due to this conflict of needs of dif- 
ferent types is brought about a condition 
of organismic-environmental relation in 
which the primary needs are cancelled by 
social forces, or what is commonly called 
social pressure (cf. Taylor (29) on "alter- 
native response"; and Freud on sublimation). 
The different types of needs interact with 
one another. The net result of all of the 
needs in interaction is the influencing 
Motivation is a con- 


| tributing factor in the formation of dynamic 
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relations. 

Such a scheme of opposed or contradic- 
tory needs accounts for the fact that in 
cases of pathological subjects forgetting of 
obvious names and events takes place. The 
Freudians have assumed that such lapses of 
memory in these cases are caused by "re- 
pressions” due to social pressure. They, 
however, hold the unnatural “cellar” doc- 
trine of the unconscious, with their myth 
of censorship of ideas which attempt to en- 
Herbart and also 
Schopenhauer held similar notions regarding 
the suppression of ideas. This mysterious 
interpretation of the mind is unnecessary 
and lacks the virtue of parsimony when the 
"mind" might be considered as a potential 
organismic property of protoplasm which 
comes into being by reason of the fact that 
the organism is related to the environment 
in a peculiar manner. The major psychologi- 
cal processes are developmental problems. 
These processes result from the formation 
of dynamic relations. 

The relation of memory to learning. 
When such organismic tensions are resolved, 
learning takes place. Learning is a reor- 
ganization of potential, or reorganization 
of the psychobiological field, which reduces 
the dynamic relation to a minimum. To make 
this somewhat vague paragraph clear, the 
fundamental differences between memory and 
learning will be pointed out. 

The learning process is dependent upon 
degree of physiological differentiation, 
and environmental barriers to the satisfac- 
tion of needs. Differentiation is an 
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important prerequisite for learning for two 
reasons: (1) If the degree of differentia- 
tion is not sufficient, the organism cannot 
be subject to a particular organismic situ- 
ation of tension (1.e., no problem will be 
present, or no dynamic relation will be ef- 
fective). (2) If the organismic-environmen- 
tal stress is present, differentiation is 
necessary to resolve the strain or reorgan- 
ize the field. In case the relation is 
present but not partially resolved due to a 
lack of a sufficient level of physiological 
differentiation, either failure of the 
adaptive response, maladjustment to the 
situation, or death (in the instance of 
"jungle environment") ensues. Dynamic re- 
lations or psychobiological tensions are 
susceptible to experimental quantification 
and measurement. In the comparative labora- 
tory, for example, Warden has measured ten- 
sions by the obstruction method (31) in the 
case of the sex and hunger motivated albino 
rat. 

Obstructed needs set the stage for the 
learning process as they do also for the 
process of memory, provided that the physio- 
logical level is of a sufficient grade to 
meet the two conditions stated in the para- 
graph just above. The memory process, of 
which the physiological correlate is differ- 
entiation of the organism as a whole, ac- 
counts for asymptotic behavior or perfec- 
tion of the act, commonly referred to as 
selection of the adaptive patterns of ac- 
tivity and elimination of the incorrect re- 
sponses, 

At each successive dynamic relation 
the maturational level of the organism is 
on a higher plane. That is to say, organ- 
isms may grow as they are subjected to dy- 
namic relations. Psychological growth is a 
matter of residual psychobiological ten- 
sions in the organismic sense, and in this 
sense memory is synonymous with growth. 
Adams conceived of retentivity as "a uni- 
versal property of protoplasm" (1), and 


psychological growth as so conceived is like- 


wise a universal biological fact. Differ- 
entiation of the organism as a whole thus 
enables animals to acquire more contacts 

with nature, and thereby subjects them to 
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more diverse dynamic psychobiological rela- 
tions. When the white rat meets 4 new maze 
situation it is subjected to a new organis- 
mic tension--a condition of distortion of 
the psychobiological field. The first ac- 
quisition of the true response is a reor- 
ganization of this field of forces, or a 
partial resolution of the condition of dy- 
namic relation of genetic-environmental 
forces. This acquisition may be conceived 
as leaving a residuum of physiological 
forces within the integral organism which 
coalesce with one another. Learning may be 
considered as evolution of psychobiological 
tensions, as has been suggested by a number 
of Gestalt psychologists, or the process may 
be defined as perceptual evolution.+ Memory, 
or the propensity for recall, is a process 
which has been abstracted out from the more 
basic psychological learning process. The 
relation of memory and learning is as a by- 
product of an important manufactured com 
modity to the basic product itself. 








(d) The configurational viewpoint 

experimentally considered 

In human subjects there is a high cor- 
relation between speed of learning and re- 
tention. This fact has been indicated ex- 
perimentally by Norsworthy (22), Woodworth 
(36), and Pyle (26). In psychological proc- 
esses Hollingworth states “correlation rath- 
er than compensation is the developmental 
rule" (12, p. 27). This fact is evident 
when one conceives of the organism as grow- 
ing and differentiating organismically as a 
unit, rather than postulating a separate 
mental faculty for the various abilities, 
or considering the organism mechanically as 
built up out of a number of discrete physio- 
logical units or atoms. 

It has been found in the experimental 
laboratory that, if after learning and be- 
fore recall, certain types of activity are 
interpolated, this activity affects, and in 
some cases inhibits, the performance of re- 
call. Miller and Pilzecker pointed out the 
fact of retroactive inhibition at the begin- 
ning of the century (21). They attributed 
the effect of interpolated activity to its 





1. Cf. A former study by the writer dealing with a configurational conception of the learning process (3). 
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interference with the process of "neural 
set." The latter process was presumed to 
account for retention. The neural doctrine 
has since been criticised and largely aban- 
doned as a result of Lashley's work in ex- 
perimental physiology referred to above. 
But the fact of retroactive inhibition re- 
mains. It appears to fit in well with the 
viewpoint now being considered. 

If the organism be conceived as hav- 
ing attained a particular degree of differ- 
entiation which enables it to undergo dy- 
namic relations of a particular level, the 
resolutions of these tensions mst result 
in the augmented physiological differentia- 
tion in a particular direction. Activities 
which result from different dynamic rela- 
tions such as interpolated organismic-en- 
vironmental situations would naturally 
cause differentiation in an opposite direc- 
tion. The interpolated situation would 
thereby tend to inhibit resolution of ten- 
sions which would normally result from re- 
lations similar in nature to the former 
ones. 





III. SUMMARY 


"Memory"--the term is an abstract arti- 
fact which has no definite connotation. Its 
historical interpretations fall under three 
categories: 

1. Argumentative, which assumes memory 
phenomena to be manifestations of some 
mysterious psychic faculty, 

2. Atomistic associationism, which pos- 
tulates an equally mysterious "force" said 
to account for the memory process, and 

3. Physiological atomism, which in- 
sists that the psychological evidences of 
memory are due to traces or grooves left by 
experience in discrete neural parts of the 
organism, 

None of these schemes is adequate in 
an objective custom-free working conception 
of this psychological problem, 

Rather the phenomenon results from the 
fact that organisms are related in a par- 
ticular manner to environments. This fact 
entails the following considerations: 

1. Memory depends upon one fundamental 
condition. This condition is a particular 
degree of physiological differentiation of 
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the organism as a whole. This is a neces- 
sary prerequisite of the process for two 
reasons, 
a. So that the irritable organism 5 
may be subject to a state of dynamic rela- 
tion. Dynamic relations are defined as 
states of organismic-environmental tension. 6 
b. So that this dynamic relation 
may be resolved. 

2. Memory is related to the learning 7 
process as the resolution of the state of 
dynamic relation is related to the augmented 
level of physiological process which eventu- 
ates from the formation and resolution of 
dynamic relations. Its physiological cor- 
relate is increased bodily differentiation 
of the organism as a whole. ) ‘ 

3. Memory accounts for continued im- 
provement in behavior (asymptotic perform- 
ance). 4 

4, Vicariousness of recall (the fact j 
that one name or event is recalled instead 
of another) is due to the motivational fac- 
tors present at the time of the formation 
of the particular dynamic relation, inter- 
polated organismic-environmental conditions, 
and the dynamic relation existing at the 
time of recall. Motivation is considered 
as a state of obstructed need. Needs are 
potential universal properties of proto- 
plasm. They may be either primary biologi- 
cal or societal in character, or a fusion 
of tne two types. 

5. The present fund of experimental 
factual evidence concerning correlation be- 
tween learning and retention, and retroac- 
tive inhibition, is not contrary to the 
configurational scheme of the memory proc- 
ess, 
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STATISTICAL TECHNIQUES FOR THE STUDY OF INSTITUTIONAL DIFFERENCES 
by 
Wilhelm Reitz 
University of Chicago 


The results of testing programs, both 
state and city, have revealed considerable 
differences in scholastic aptitude, educa- 
tional achievement, and other qualities of 
student bodies of various institutions. Oft- 
en these differences are much greater than 
those found in the student body of a single 
institution. Henmon, in discussing the dis- 
tributions of percentile ranks on the Ohio 
State psychological test obtained for col- 
leges in Wisconsin, states: 

"We are nowadays accustomed toa 
wide range of individual differences 
in any test that may be given, but are 
scarcely prepared for such extraordi- 
nary range of group or institutional 
differences as appear here. A range 
in medians for the 18 or 20 colleges 
on a basis of declaration of intention 
to enter from 39 - 76 and in the actual 
entrance from 40 - 76, goes far beyond 
what anyone would have expected. It is 
a striking fact that the upper quartile 
point for actual entrants in six of the 
institutions does not exceed the me- 
dian in the two institutions with the 
highest selection. Again, the median 
percentile rank of actual entrants for 
the three institutions is below that 
of the lower quartile in two or three 
institutions." 

Bixler, in studying the variations in 
average scores on the American Council Psy- 
chological Examinations among the high 
schools in the Chicago area, reports:? 

"The range of the individual 
school was from 4-334, with a pos- 
sible score of 370, and the range of 





the median score of the class was from 
65.0 to 201.4. The lowest median score 
was lower than the lowest individual 
score in six schools. The median 
scores in three schools were as high 
as, or higher than, the median scores 
of the 1930 freshman class of the Uni- 
versity of Chicago." 

Similar variations, with regard to in- 
telligence, English, and achievement in 
other subjects, have been reported in the 
Pennsylvania study.® Haggerty obtained like 
results in connection with his studies on 
the revision of standards of the North Cen- 
tral Association. Placing 60 institutions 
in the order of their average intelligence 
scores Calculated from data covering a three- 
year period, he found that the institution 
ranking highest had an average median score 
of 228 and the lowest a score of 95.4 

There is no doubt that these differenc- 
es raise some very fundamental questions 
with regard to the educational policies of 
institutions either receiving students from 
or supplying students to other institutions. 
Some of the educational problems met by in- 
stitutions receiving students are briefly 
outlined below, 

Should an institution find that a cer- 
tain measure, say scholastic aptitude ex- 
pressed in percentile ranks on the national 
basis as provided for the American Council 
Psychological Examination, shows a variation 
of the means from year to year to be less 
than the variation about the means® of the 
successive years, that institution would be 
justified in assuming that the average of 
the various yearly averages is characteristic 











- V. A. C. Henmon and F. 0. Holt, A Report on the Administration of Scholastic Aptitude Tests to 54,000 High School Sen- 





iors in Wisconsin, June, 1951. 





- R. W. Bixler, "Rating the College Entrant", Journal of Higher Education, III (October, 1952), 361-65. 





5. Study of the Relations of Secondary and Higher Education in Pennsylvania, The Carnegie Foundation for the Advancement 





of Teaching in Cooperation with the Joint Commission of the Association of Pennsylvania College Presidents and the 


State Department of Public Instruction, New York, 1951. 


- M. E. Haggerty, "The Product of Higher Educational Institutions", North Central Association Quarterly, Preprint, 


Table IV. 





- The expression "variation of the means" refers to an estimate of the variance (i.e., the square of the standard devi- 
ation) derived from the yearly averages; “variation about the means" refers to an estimate of the variance derived from 


the yearly standard deviation. 








merit ow ke emerald 





12 JOURNAL OF EXPERIMENTAL EDUCATION 


of its student body. The institution could 
further assume that its internal educational 
work in the course of time has been adapted 
to students of that level of ability and 
that the curriculum, aims and objectives, 
methods of instruction, evaluation of stu- 
dents! achievement--in brief, all institu- 
tional functions--are fairly adjusted to the 
particular level of ability. On the other 
hand, should the variation of the means be 
larger than the variation about the means, 
indicating a heterogeneous student body from 
year to year, the institution could not make 
such assumptions, but must assume that the 
fluctuation in student material is met by 
proper flexibility in its educational proce- 
dures. Should the heterogeneity exceed cer- 
tain bounds, it probably would be impossible 
for the institution to function effectively. 
This might be true, for example, if the stu- 
dent body for one year were highly intelli- 
gent, having an average equal to the 75 per- 
centile, and the next year would drop to the 
25 percentile; or would for one year be co- 
educational with a fifty-fifty ratio, and 
the next year would consist only of male or 
only of female students. Unless the machin- 
ery were highly flexible and the staff able 
to adapt the internal work to the prevailing 
status of its student body, serious institu- 
tional deficiencies would result. 

A further advantage accruing from the 
knowledge of the relative variability of the 
means from year to year, and about the mean 
of any one year with regard to certain meas- 
ures, is seen in the possibility that insti- 
tutions which find the variability of the 
means to be less than that about the mean 
for any one year might be able to determine 
rather stable quantitative norms for the se- 
lection of their students. 

Thus, for example, an institution which 
finds that it has a student body varying 
during a three- or four-year period around 
the 75 percentile rank on the American Coun- 
cil Psychological Examination seems to be 
justified in using such a percentile, with a 
corresponding range, as a definite criterion 
for the admission of students. Several sim- 
tlarly determined criteria should be of con- 
siderable help to institutions in selecting 
and guiding their student bodies. 
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The value of such criteria assumes even 
greater importance in the light of the re- 
cent move of the North Central Association 
to revise its standards in judging institu- 
tional excellency. The shift from quantita- 
tive and mechanical to qualitative and ju- 
dicious standards will necessitate that an 
institution's educational achievement be 
evaluated in terms of the abilities of its 
student body, to quote a pertinent passage 
by Haggerty: 

"What we may legitimately ask of 
any institution is that it clearly ap- 
prehend its function in terms of the 
character of its student body, that it 
frankly state its purpose and its pro- 
gram, and that it adapt its procedures 
to the character of its clientele. Let 
any institution do these things, let 
it drop its pretenses of being some- 
thing that in the nature of the case 
it cannot be, let it demonstrate that 
it can perform a useful service for 
the type of youth who seek its minis- 
trations, and it will be as worthy of 
social recognition as the one which 
prides itself on the high intellectual 
level of its student group. ....-ececee 

"It is not unvarying conformity 
to a single pattern that we seek; it 
is effective institutional service to 
the widely varying abilities and in- 
terests of the young people who seek, 
through higher education, a meaningful 
preparation for the kind of life they 
desire and can achieve. What we re- 
quire is that an institution shall il- 
luminate its practices by a clear un- 
derstanding and description of the par- 
ticular functions it essays to perform 
for the particular group of students 
who choose to enter its doors. In the 
light of this principle an understand- 
ing of the intellectual characteris- 
tics and interests of each student 
group becomes of first importance to 
each institution itself and to any ac- 
crediting agency to which it is in any 
way related." 

Business and industry have for many 
years resorted to rigid statistical tech- 
niques in the determination of the quality 





1. Haggerty, op. cit., p. 9. 
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of raw materials and finished products;! but 
educational institutions, handling probably 

the biggest "business" in the country, have 
until recently made little progress in de- 

termining their procedures in the light of 

the character of their student bodies, and 

conversely, the qualities necessary for the 
achievement of their particular aims and ob- 
jectives. 

Although institutions have had ample 
data available for a number of years, ade- 
quate statistical techniques have not been 
sufficiently focused on them to render such 
data useful to studies of institutional dif- 
ferences as related to qualities of their 
student bodies. It is, therefore, the pur- 
pose of this paper to apply certain well es- 
tablished statistical techniques for the 
analysis of suitable data pertaining to the 
student body of educational institutions. 

In essence, the techniques may be ex- 
plained as furnishing an index of homogenei- 
ty. This index would determine whether or 
not students comprising the enrollment of an 
institution are so selected from year to 
year as to form a homogeneous or a hetero- 
geneous population. It would also determine 
whether or not certain groups of students 
admitted year after year from a particular 
institution are of the "same uniform run", 
having due regard, of course, to individual 
variation, and whether or not students vary 
more in a particular measure than might be 
expected through chance. 

Pertinent institutional measures will 
be presented first. On the basis of these 
data, the statistical problem will be formu- 
lated and the techniques explained. While 
there is no restriction to any particular 
measure, the percentile ranks on a national 
or state-wide basis, as, for example, fur- 
nished in the Manual of the American Council 
Psychological Examination, are used for il- 
lustration. 

The whole procedure is first applied to 
an institution which might attempt to ascer- 
tain the central tendency of a measure per= 
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taining to its student body, (e.¢c., the mean 
of intelligence), by defining certain of its 
characteristic features in the form of a 
quantitative expression which may ultimately 
serve as a standard for selective admission. 
A second example will show how an institu- 
tion might use this technique to determine a 
rating of those institutions from which its 
students are drawn. 

The following measures are presented to 
illustrate the first case, that of an insti- 
tution attempting to determine: (1) whether 
or not its student material, over a period 
of years, has been homogeneous or heterogen- 
eous with regard to intelligence as measured 
by the American Council Psychological Exam- 
ination; and (2) what has been the level of 
intelligence of these students, with a view 
to determining approximately a standard and 
a critical range for future selective admis- 
sion. 

The data in Table I were prepared in 
the following manner. The raw scores on the 
American Council Psychological Examination, 
as reported by an institution for four suc- 
cessive years, were translated into the na- 
tional percentile ranks as found in the Man- 
ual of Instruction. This process is neces- 
sary in order to unify® all distributions as 
though they had all been obtained on one 
measuring instrument. The same unification 
would have been obtained had the various 
distributions been translated by way of the 
national percentile ranks into a score dis- 
tribution of any one year; for example, into 
the first year distribution. An illustra- 
tion will make this clear. If the first, 
second, third, and fourth years represent the 
score distributions of the 1928, 1929, 1930, 
and 1931 editions respectively, and if the 
distributions of scores on the 1929, 1930, 
and 1931 editions are translated into the 
score distributions of the 1928 edition, the 
process of transforming is as follows: a 
score of 155 on the 1931 edition represents 
a national percentile rank of 55, and to a 
score of 146 on the 1928 edition corresponds 





1. E. S. Pearson, "The Analysis of Variance in Cases of Non-Normal Variation", Biometrics, XXIII (1931), pp. 114-83. 


2. These percentiles are sometimes subject to fluctuations due to the kinds of colleges reporting. 


For instance, the 


latest reports on the Psychological Examination in the Educational Record show that colleges with low median scores 
tend to discontinue to send in their reports on which the national percentile norms are based. This, of course, 


renders national norms non-comparable. 
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@ percentile rank of 55; therefore a score 
of 155 on the 1931 edition is equal to a 
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This method could be used in transforming 
the yearly national or state distribution 


























score of 146 on the 1928 edition. into one standard distribution. This is the 
TABLE I 
DISTRIBUTIONS OF INTELLIGENCE PERCENTILE RANKS OF STUDENTS 
OF INSTITUTION A OVER A PERIOD OF FOUR YEARS 
National Distributions of Percentile Ranks Total 
Percentile Se eee Distribution 
Ranks lst Year 2nd Year 3rd Year 4th Year All Years 
90-100 22 26 25 35 108 
80-89 17 15 21 14 67 
70-79 ll ll i) 7 38 
60-69 14 12 19 65 51 
50-59 6 7 12 7 32 
40-49 10 14 18 5 47 
30-39 5 7 a 7 27 
20-29 ll 6 2 21 40 
10-19 6 5 7 6 24 
1-9 3 " ae Oe 1 = aa | 
ni ) 105 104 ae > al |  (N) 440 
(m ) 62.9048 65.7692 65.4098 63.3486 _ (WM) 64.3864 © 
(34) 27.5576 25.7089 24.2292 30.0920 | (SD) 26.9484 

















The treatment of the other distributims 
would be similar. From a purely statistical 
point of view, the equating of scores of dif- 
ferent years into the score distribution of 
one year used as the base is more satisfac- 
tory than the transformation of scores into 
percentile ranks, because score distributions 
more or less approach normality in distribu- 
tion, while percentile ranks form rectangu- 
lar distributions. Since the techniques 
proposed in this study are based on normal 
distributions, the use of percentile ranks 
must be thought of as a convenient measure 
to illustrate the whole procedure, and as 
permissible, insofar as errors introduced 
will not invalidate the results. Another 
way of eliminating the variations in the 
distributions due to the individual test 
editions would have been by Hull's! methods 
of rendering comparable distributions of 


proper method if scores are desired rather 
than percentile ranks. 

After the unification of the yearly 
measures was obtained, the means and the 
standard deviations were computed for each 
year, as well as for the total of all years. 
Table I shows the following means for the 
various years: 62.9, 65.8, 65.4, and 63.3. 
For the total of all years, the means is 
64.4. The standard deviations are 27.6, 
25.7, 24.2, 30.1, and for the total, 26.9. 

To answer the question as to whether or 
not the student body has been homogeneous or 
heterogeneous in intelligence over the four- 
year period, it is necessary first to deter- 
mine whether or not the various means fluc- 
tuate more or less about the total mean than 
might be expected through chance, giving due 
regard to the observed variation among the 
individual distributions. 





different means and standard deviations. 








If the variation of the individual 








1. C. L. Hull, Aptitude Testing, p. 596 ff. New York: 





World Book Company, 1928. 
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means is within chance fluctuation, the 
yearly distributions of students can be 
thought of as homogeneous or as samples of 
the same parent population whose statistical 
characteristics are approximately represent- 
ed by those of the total distribution. The 
mean of the total distribution, therefore, 
would be representative of the four distri- 
butions. 

Two methods will be presented and dis- 
cussed by means of which the foregoing prob- 
lem can be solved. One method consists in 
computing the correlation ratio! and in 
testing its significance. This method is 
fully presented by Pearson*® in his Tables 
for Statisticians and Biometricians. The 
second method consists in a study of vari- 
ance by the method Fisher® has outlined in 
his volume, Statistical Methods for Research 
Workers. According to the first method, a 
test of significance will show if the varia- 
tion between the arrays (in the present 
study, between the various years) is signif- 
icantly greater than is to be expected, 
without correlation, from the variation with- 
in arrays. The second method demonstrates 
whether the variance of the means is not 
more than might be expected through chance 
or not more than the variance about the 
means. 

Should the test of significance be pos- 
itive by either method, the yearly samples of 
student bodies may be considered as coming 
from different parent populations, and, 
therefore, as heterogeneous. If the test 
shows negative results, the samples may be 
considered as homogeneous; i.e., as coming 
from the same parent population. In the 
latter case, the total distribution would 
give a fair picture of the present popula- 
tion from which the institution draws its 
student body. Its average and the standard 
deviation could form the base for the de- 
termination of a standard of selective admis- 
sion. Whenever the student material is het- 
erogeneous, such a standard could not be de- 
rived in the same way, at least not with the 
same reliability. 
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By the first method, the correlation 
ratio (n) is computed and tested according 
to the procedure outlined by Pearson. This 
test is analogous to testing the significance 
of the difference between two means, 1.¢6., 
by relating the difference to the standard 
error of the difference. In this case, if 
(n2) is found, the forma is: 


(1), = observed n*® = mean n° 
standard error n?” 





observed n* = the square of the 
correlation ratio 


-K-1 


2 
(2) mean n Ne 





K = number of samples or years 

N = total number of cases (stu- 
dents), all years 

standard error »* is found by 
forma: 


1 _/e(k-1)(N- K) 
(3) Fa V Nel 








The correlation ratio (yn) can be found either 
by relating the average standard deviation of 
the various arrays (in this case, years) or 
the standard deviation of the various means 
of the arrays to the standard deviation of 
the total distribution. According to the 
first computation: 


(4) n? =l- si(ny 8; ) 


NSD@ 
and according to the second: 


k 2 
(5) n* = Sunita — MD 


In these formulas, si stands for the summa- 

tion of all quantities from 1 to k, 1i.e., for 
the individual years; ny, stands for the mm- 
ber of students in the various years; s, for 

the standard deviations of the various years; 
and SD and M for the standard deviation and 

mean of the total distribution. 








1. 1 xyt y representing the group number or the number of years, and x, the particular measure which is stuiied, such 


as intelligence. 


2. K. Pearson, Tables for Statisticians and Biometricians, Part II, pp. xl and 16. 
5. R. A. Fisher, Statistical Methods for Research Workers, Chapter VII, pp. 194 ff. 
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Institution A 





In computing n* for Table I, it is 
found that 
P _ 318859.77 _ 
by (4) n* = 1 31953430 -002111 
674.55 
2 nit te 
by (5) n*# = 310634.3 " .002111 


The computation of n* by two different meth- 
ods is advisable for the purpose of accura- 
cy; the results by both methods should be 
identical to a reasonable number of decimal 
places. 

The next step in the procedure consists 
in testing of n* for its statistical signif- 
icance. The mean n° is found by formula (2). 
Applying this to the situation in Table I, 
Institution A, 





k = 4, N = 440, so that 
4-1 3 
Se « - " 
mean n 440-1 339 = 006834 


The standard error n* is found by formula 
(3). Substituting the corresponding values 
in this formula, the following is obtained: 


—— ya eee 8) 
440-1\ 440+1 





Stand. error n° 





= .005548 

Fortunately the mean and the standard error 
of n* need not be computed, but may be found 
in Table IV of Part II of Pearson's Tables 
for Statisticians and Biometricians. fThis 
table furnishes directly all means and sig- 
mas of n* for population of 51 to 1000 and 
for 3 to 20 arrays. Substituting the three 
values thus far found in (1), 
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If the hypothesis that the means differ only 
_ through chance fluctuations be true, 
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002111 - .006834 
r1 = = 
-005548 


004723 _ 


~ .005548 — 


it is immediately seen that the resulting 
ratio (,) is negative and insignificant. The 
conclusion can be drawn that the yearly av- 
erages fluctuate within chance expectation 
having due regard to the fluctuation of the | , 
individual measures about the individual av- 
erages. In other words, Institution A has 
had fairly homogeneous student bodies over 
the four-year period studied. The total av- 
erage of 64.3864 and the standard deviation 
of 26.9484 may be considered representative 
parameters of the population from which the 
yearly distributions have been drawn. 

According to Fisher's procedure of 
analysis of variance", the same conclusion 
can be reached, although the statistical 
technique is quite different. According to 





-.85 


as ff af Ge 


a | 


| Fisher, the following set-up is necessary 


for analysis: 





f 









Degrees | 
| Variance of Sum of Squares 
-_ — |Freedom | 
| From Variation) k | 2) 
| of Means :-: (Fr - “| 
(6) From Variation ba s* tn 2) 
about Means N-k n 2 i St 
cieeslhdiniadineesseemsealininesDendlaeili setvapecialls a 
N 2 
Total |w-2 |+8; (s - mu) 
RP WE, 


1 St {ny(my - u)*} 
N k-1 


1 s(n; si) 
N N-k 





and 


will be two independent estimates of the un- 
known group variance o*, The hypothesis is 
tested by examining the significance of 2, 
the natural logarithm of the difference of 
these two estimates of variance. To illus- 
trate Fisher's procedure, the data pertain- 
ing to Institution A are used. 





———— . ae wee noe) oe eee ON BT Tee 
Variance of of Mean 
| Freedom | Squares | Square | SD _ __10g_ SD_ 
From Variation of Means | 3 1.5330 | e5110 | .71484 9.6643 - 10 
From Variation about Means 436 724.6813 1.6621 1.28922 - 2540 
Total 439 726.2143 1.6542 1.28617 9.4103 - 10(z) 
or -.5897 
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Fisher's z is found to be the difference be- 
tween the natural logarithms of the two 
standard deviations. For Institution A, 
this difference is 9.4103 - 10, or -.5397. 
The computation of z can be checked by the 
relation which exists between »* and z, 
namely, 


(N= kk)? | 
10&e > (k = 1)(1 = n)* 


1 


2 


(7) z= 


When the corresponding values are substitut- 
ed in (7), the following result is obtained: 


(440 - 4)(.002111) 


(4 = 1)(0 = 002111) | 


= 9.4103 - 10 or - 5897 


This result is identical with that obtained 
by the Fisher technique. Again z may be 


tested for its significance by relating it 
to its standard error,’ which is given by 


the formula: 
1 es oe \ 
= = + = 
v 2 ny No/ 


=k-1 and n, 


(8) SE, = 


ny) =N-k 
Substituting the corresponding values in (8), 
the SE, is found to be 
$y) wae 
> ( as) 


=\/ 3 





2 «4097 
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| variation about 
| of four years is much larger than the varia- 
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-5897 
.4097 = =-1,.44 


is obtained, which signifies clearly that z 


Relating z to SE,, a ratio of - 


| is insignificant in the sense that the va- 


riance about the means is much greater than 
that of the means. With regard to Institu- 
tion A, it may again be concluded that the 
any one mean over a period 


tion of the means from year to year, i.6., 
Institution A has been populated over a pe- 
riod of four years by students of approxi- 
mately the same average intelligence. 

In the correlation ratio procedure, 
whenever the mean n* is larger than n°, 
there is no need to consider a test of sig- 
nificance of n*, as the resulting ratio (h) 
would always be negative. The same is true 
if, in Fisher's procedure, the SD resulting 
from the variation of the means is smaller 
than that resulting from the variation about 
the means. 

Before further discussing the foregoing 


‘results, the same technique will be applied 


to another institution where conditions are 


/ such that quite opposite conclusions must be 


drawn. ‘Thé pertinent data are furnished 
Table II. 


in 


TABLE II 


DISTRIBUTIONS OF INTELLIGENCE 


PERCENTILE RANKS OF STUDENTS OF 


INSTITUTION B OVER PERIOD OF FOUR YEARS 








National 
Percentile 
Ranks 2nd Year 





100 
89 
79 
69 
59 
49 
39 
29 
19 

9 


5 
10 

5 

5 
10 
12 
15 
18 
15 
17 





(ny ) 125 112 


Distributions of Percentile Ranks 


3rd Year 
- _ ——— + —_ - _ 


Total 
Distribution 
4th Year All Years 
22 
17 
12 


61 
62 
39 
46 
41 
39 
44 
42 
41 
29 


me 1 1D 


ePyvanwvN On Qr~z 


ee 


102 


- —E _ 


105 





(m ) 50.0400 39.2857 





(N) 444 





(34) 25.1595 27.0157 











61.4706 


70.6190 (M) 54.8198 
23.9015 (SD) 28.9959 

















1. Ibid., pv. 195. 


29.6267 
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On applying the correlation ratio technique, 
the following quantities are found: 


310381.831 
370985 .586 


60603.755 
370985 .586 
4-1 
444-1 


by (4) n? = 1- = .163359 


by (5) n? = = .163359 


by (2) mean »* = = 006772 


by (3) standard error n* 
— 2(4 - 1)(444 - 4) 
‘aa ni\' 











444 +1 
= .005498 
163359 - .006772 .156587 _ 
by (1) .005498 ,00s498 ~ 75-48 
The resulting critical ratio, 28.48, indi- 


cates a significant correlation between the 
arrays or, in other words, the various means 
fluctuate significantly more than might be 
expected through chance. By applying Fish- 
er's technique to the data of Table II, In- 
stitution B, and by substituting the proper 
values in (6), a Zz value of 1.67736 is ob- 
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Z== s =. 


1 

lo 
2 te (f - 1)(1 - .163359)/" 2 
28 .637515 = 1.677359 





log, 





(444 - 4)(.163359) 1 


This value is sufficiently identical with 
the foregoing value that all calculations 
may be considered trustworthy. 

The two methods clearly indicate that 
the data of Institution B show the average 
intelligence of the student body of this in- 
stitution over a four-year period to have 
varied greatly from year to year. The dif- 
ferent yearly groups cannot be thought of as 
being drawn from the same population; conse- 
quently, the total average of 54.8198 and 
the standard deviation of 28.9059 cannot be 
taken as representative of the student bod- 
ies of this institution. 

If the data presented for Institution A 
were to be used to establish a standard of 
admission, it could be seen in the average 
of all the years, 64.3864, since the present 
tests demonstrated this average to be repre- 
sentative of all four years. The variation 
of the averages with respect to the total 
average can be found by the formula: 











tained. 
Degrees Sum 
Variance of of Mean 
Freedom Squares Square sD Log, SD 
From Variation of Means 3 136 .49494 45.49831 6.74524 1.908837 
From Variation about Means 440 699.05818 1.58876 1.26046 2231477 
Total 443 835.55312 1.88612 1.37336 1.677360 (z) 




















Relating z to its standard error by formula 








(8), 
a. - 1 
SE, = 2 3° 440 s 4096 
a critical ratio of ee = 4.10 is ob- 


tained, which without doubt signifies that 
the variation of the means is statistically 
larger than that about the means. Again, 
for purposes of accuracy, the z obtained by 
Fisher's method can be checked by (7): 





si {ny(m, - u*} 


(9) standard error m = Nik = 1) 








This quality (identical with the SD from the 
variation of the means in Fisher's method) 

is found to be .7148 for Institution A. The 
question must be answered as to what extent 
the yearly averages of Institution A may de- 
viate from the total mean in order still to 
be within a range of chance fluctuation; or, 
in other words, when does the average of a 








n= 


c 
aS 
3~ 
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particular year become “off average", so 
that the college may say either the average 
ability of its student body has definitely 
improved over the previous year, or that it | 
has significantly fallen off. Such a criti- | 
cal range, within which the yearly means may | 
fluctuate, may be variously defined, such as 
+ 3 SE, or + SPE.+ For the present illus- 
tration, a value of 3 SE in any one direc- 
tion from the total mean may be considered 
as an “off mean." Since SE of the total 
mean for Institution A is .7148, the criti- 
cal range can be defined as 64.3864 + 3(.7148) 
or from 62.242 to 66.5308. Should any year- 
ly mean fall outside this range, it would 
deviate significantly from the total mean, 
which is representative of the average abil- 
ity of the student body of Institution A. 

The same inquiry may be extended and 
the question be raised as to the limits be- 
tween which any individual should fall in 
order that the institution may maintain the 
same characteristics of the student body as 
indicated by the total distribution. To 
answer this question for the present per- 
centile data, in terms of the mean of the 
total distribution plus and minus a multiple 
of the standard deviation, is unsatisfactory 
due to the peculiar characteristics of the 
standard deviation of a percentile distribu- 
tion. The best that can be done is to de- 
fine a critical range in terms of the lowest 
and higher percentile rank on the national 
basis and keep the selection of students 
within that range. The quota of students 
within any one tenth could be made to approx- 
imate the percentages found in the total 
distribution as given in Table I. In case 
raw scores are available, the critical range 
for the selection of individual students is 
simply given by the total mean plus and minus 
a certain mltiple of the standard deviation 
of the total distribution. Any student whose 
individual measure would fall outside this 
range could not be considered a good selec- 
tion for Institution A. 

While the establishment of the latter 
two critical ranges has been perfectly per- 
missible for Institution A, it is not per- 
missible for Institution B, because the 








analysis has shown that the total average is 
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not representative of the yearly averages. 
In other words, Institution B did not have a 
definite standard of scholastic ability for 
its students over the four-year period stud- 
ied. 

One way of arriving at a representative 
average for Institution B is by combining * 
the set of four means, having due regard for 
their individual standard errors and weights. 
The process is as follows: The SE's of the 
four means are computed by the formula: 


(11) standard error m, od 
y My 


The weight w, for each mean is computed by 
the formula: 


1 
(12) WwW; = ‘sd? 
i 


Each mean is then weighted by its w,, and an 
average mean for the set of four is found by 
the formula: 


34 
= t Mm, Wy 
Ss; w 
. . 
The standard error of this M ncopines 15 found 


by the formula: 


( 13 ) M combined 


1 


(14) standard error Mocabinea = Tetw 
ot Sa 


The results of these computations are given 
in Table III. 


TABLE III 


THE COMBINATION OF THE FOUR HETEROGENEOUS 
MEANS FOR INSTITUTION B 




















— — 
my sd my % | my Wy 

+++ — 

50.0400 2.2503 | 1975 | 9.8629 | 
39.2857 2.5527 | .1535 | 6.0304 
61.4706 2.9335 .1162 7.1429 
70.6190 2.3325 +1838 | 12.9798 

Total .6510 36.0360 | 











From this table, it is found that 


36.0360 
a 


M combined = 





1. J. B. Scarborough, Numerical Mathematical Analysis, p. 546. Baltimore: The John Hopkins Press, 1920. 





2. Ibid., p. 328. 
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1 
Standard error M i 
combined 6510 | 
} 
3 ——_—_—— zs . 
806846 1°2594 | 
A representative standard for Institu- | 


tion B, therefore, is the M.oopinea » OF | 
55.3548. A critical range could be set up 
by adding and subtracting 3 SE M wonbineg3 this 
range would be between 51.6366 and 59.0730. 
Should Institution B be willing to set up 
any admission standard, having due regard 
to the average ability of its student body 
over the four-year period studied, the 
Moombined *& 3 SD combineg WOULA be the best 
standard under the circumstances. This, of 
course, does not mean that any institution 
may not arbitrarily set any other standard 
and critical range for the selection of its 
student material. Such arbitrary fixation, 
however, always implies the danger that the 
institution may be out of harmony with the 
standard it may have, if it draws without 
special effort on its natural resources. In 
other words, Institution B, in the long run, 
would be better off if it chose Mcompinea 

+ 3 SDcombined @S its standard and critical 
range, rather than the standard set, perhaps 
by the fourth year. In the former case, a 
certain continuity with respect to geograph- 
ic and social-economic factors is preserved 
--factors which may not be consistent with 
any arbitrary standard. 
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Up to this point it has been illustrat- 
ed how an institution may ascertain certain 
characteristics of its student body and how 
a quantitative expression of such character- 
istics may lead to the establishment of 
definite standard and 2 critical ranze 


] 


Sse- 
The same technique, how 
ever, may also be applied by an insti ton 
attempting to evaluate another institution, 
particularly a high school, with regard to 
the characteristics of the students admitted 
The question may be raised 
school has sent to the 
college homogeneous or heterogeneous material 
with regard to a certain measure. 

To illustrate the two foregoing methods 
of analysis, two institutions are considered. 
In the first case, the high school has fur- 
nished the institution heterogeneous mate- 
rial, and in the second, homogeneous mate- 
rial. The data are given in Tables IV and Vv, 
By substituting in forma (1), as described 
on page 15, it is found that the critical ra- 
tio: 


Ti) + 
Viv 


- 22936 - . 
~0230 


SS = 11.22 


and using forma (6), the value of z is 
found to be, 2 = 1.2447. By relating z to 
SE,, -3596, a ratio of 3.46 is obtained, so 
that both methods clearly indicate the fluc- 
tuation of the means to be significantly 


TABLE IV 


DISTRIBUTIONS OF INTELLIGENCE PERCENTILE RANKS OVER PERIOD OF FIVE YEARS 





OF STUDENTS OF HIGH SCHOOL A ADMITTED TO HIGHER INSTITUTION 


















































=_— a — —— —-~ ----- —~ 
National Distribution of Percentile Ranks Total 
Percentile — ~~ +4 Distribution 
Ranks lst Year 2nd Year 3rd Year 4th Year 5th Year All Years 
————— = ee —————_+}- a 
90 - 100 8 < 4 - | 7 17 
80 - 89 4 - | r o 2 12 
70 - 79 5 1 1 3 1 11 
60 - 69 3 1 2 5 1 | 12 
50 - 59 3 3 3 4 2 15 
40 - 49 3 4 3 | 1 1 12 
30 - 39 - | 5 5 - 1 | 11 
20 - 29 - | 4 | 2 | ° 3 9 
lo- 19 - 5 | 5 | 5 1 16 
fare - 4 | - 2 - 6 
om . - +—_—__—_—__— ——--—— +4-— i oe _ 
pe {et aS Me 27 = = 20° BS: (N) 121 
(m, 75.77 $1.67 a * a 45.00 62.65 (M) 53.60_ 
(sy ) 17.30 19.05 = °° Se 25.30 28.61 (SD) 28.41 | 
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t= -reater than might be expected through mere chance. The complete method of Fisher is giv- 
n en below. 
4 
~ Degrees Sum 
Variance of of Mean 
e- Freedom Squares Square SD log. SD 
y : - — + | > | > 
‘ From Variation of Means 4 | 236.9743 59.2436 | 7.697 2.04083 
~om Variation about Means 116 570.1427 4.9150 | 2.217 | . 79615 
t t | 
re Total ; 120 | 807.1170 6.7260 | 2.593 | 1.24468 (z) 
ed q 
al § TABLE V 
is DISTRIBUTION OF INTELLIGENCE PERCENTILE RANKS OVER PERIOD OF FIVE YEARS 
i Oe OF STUDENTS OF HIGH SCHOOL B ADMITTED TO HIGHER INSTITUTION 
National Distribution of Percentile Ranks Total 
Percentile §=§=———__—__ aii —-- + Distribution 
Vv Ranks lst Year 2nd Year | 3rd Year 4th Year Sth Year All Years 
a | 90 - 100 9 8 8 6 3 34 
~ 80 - 89 7 5 4 | 5 29 
70 - 79 M z | 4 5 24 
: 60 - 69 4 4 3 3 - 14 
50 - 59 4 6 1 2 | 2 15 
} ! 
40 - 49 4 4 | - 4 4 16 
30 - 39 2 2 | 5 . | “ 9 
j | | | 
20 - 29 - 2 - | 1 - z 
~ &§ 10 - 19 1 1 - 1 | - a 
3 l- 9 1 - | + 1 | - | 2 
$5] EEE ed —— | — - 
(ni) aie a ee ee ee | ea . 2 (N) 149 
__(m) 70.00 | 66.14 78.00 67.86 | 74.04 | (u) 69.77 
~ (sy) [22.69 | 23.39 | 22.09 | 24.76 17.43 | (sp) 22.69 
a By substituting in formas (1) and (6), the following values for school B are obtained: } 
y = 20158 = .0270 
-9187 
Z= -.27497 


By relating z to its SE,, .3584, a ratio of -.77 is obtained. The complete method of 
Fisher for Institution B is as follows: 








| Degrees | Sum | a | 
Variance of of | Mean 
| Freedom Squares | Square SD loge SD 
From Variation of Means 4 8.1216 | 2.0304 | 1.425 ~25417 
From Variation about Means | 144 | 506.7561 3.5191 1.876 -62914 
a - Gontasiehinpniaammenti ———————————— Ee = = - - — 2 = ——+ = - 
Total i 148 514.8777 3.4789 1.865 -.27497 (z) 











Both tests clearly indicate that the 
variation of the means is less than the va- 
riation about the means. This signifies 
that high school B during the period studied 
has sent to the higher institution student 
material which has been homogeneous with re- 
gard to intelligence percentiles. The aver- 
age over the five years approximates the 
70th percentile rank. 

The same statement cannot be made for 
high school A, since its student material 
has been very heterogeneous. In this case, 
the college admitting the students cannot 
rely on the high school with regard to the 
ability of the students applying for admis- 
sion. Every student from high school A, 
therefore, must be considered individually 
with regard to his ability. Should his 
measure of ability fall within the critical 
range set up by the admission office, he may 
be admitted; otherwise not, especially if 
the college wishes to keep its student body 
fairly homogeneous with regard to the par- 
ticular ability. To attempt a rating of 
such a high school is not possible without 
serious limitations. On the other hand, a 
high school which has supplied fairly homo- 
geneous student material to the higher in- 
stitution can be rated with other high 
schools of the same type. How such a rating 
can be established, and how finally a series 
of constants can be worked out adjusting va- 
rious high schools for particular measures, 
such as intelligence of students, average 
marks, and students' class ranks, has been 
shown by the writer elsewhere. Such ad- 
justment methods are only worth while if the 
fact has been established that the measure 
on which the rating or adjustment scheme has 
been based is representative of a homogeneous 
set or sets of values. 

With the present statistical technique, 
it is possible to examine all high schools 
which have furnished student material to a 
higher institution, so that finally such 
high schools can be grouped into two, or 
more, categories--high schools which have 
sent fairly homogeneous material and those 
which have sent heterogeneous material to a 
higher institution over a period of years. 
Students could be admitted from the first 
Class of high schools without detailed indi- 
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vidual consideration, while students from 
the second group could not be admitted with 
the same confidence. Students from the lat- 
ter schools probably should be subjected to 
an individual analysis of their scholastic 
aptitude, their final admission being deter- 
mined by their meeting the standard and 
critical range of the selecting institution, 
The present statistical technique, how- 
ever, need not be confined to so-called 
initiatory measures, but may well be applied 
to measures of the finished product, such as 
marks, college grade points, number of quar- 
ters or semesters in residence. These tech- 
niques also may be applied to such measures 
as intelligence of students graduating from 
an institution of higher learning, the re- 


sults of which would indicate whether or not 
| an institution over a period of years grad- 


uates a homogeneous or heterogeneous product. 
Such investigations could be utilized for 
several purposes: For example, an analysis 
of initiatory intelligence of the student 
body of an institution might show that the 
institution selected homogeneous material of 
the 60th percentile rank, and that, on grad- 
uating, the intelligence of the students 
centers around the 70th percentile rank. 
From these facts, an institution could con- 
Clude that an internal selective process 
takes place to the amount of ten percentiles 
at the level at which the institution is op- 
erating. This selective process may be nat- 
ural or excessive. If a corresponding dif- 
ference on a nationwide basis were only five 
percentiles, the institution would be forced 
to conclude its internal selective process 

to be five percentiles in excess of that 
which is reasonably expected. In other 
words, the particular institution has set 
standards for its work for which it does not 
select corresponding student material. In 
terms of dollars, this might mean, under cer- 
tain circumstances, an enormous loss to the 
institutions. In terms of educational effi- 
ciency, it might indicate a waste of effort 
and probably a less satisfactory internal ad- 
justment. While there is, in all probabil- 
ity, some selective process taking place, any 
excess is a sign that the institution is not 
doing what it intends to do. An institution 
which can satisfactorily graduate a large 








1. "Forecasting Success of Students under the New Plan at the University of Chicago." Unpublished Report to the Dean of 
Students and University Examiner, May 9, 1958. See also, "A Method of Adjusting School Marks and Class Ranks to Com- 
pensate for Variations in Standards of Marking", published in the Bulletin of the American Association of Collegiate 


Registrars, (October, 1955), 1-14. 
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percentage of the students it admits is cer- 
tainly functioning more efficiently from an 
educational point of view than is the insti- 
tution which eliminates fifty to eighty per 
cent of its students before graduation. 

The foregoing techniques should be use- 
ful also in coordinating higher institutions 
within a state, or within even a larger re- 
gion. Particularly is this true when 
through need of economy, it is desirable to 
combine or eliminate institutions. By these 
techniques it should be possible to deter- 
mine the institutions having the strongest 
units of a specialized type of education, 
and certainly that institution, or unit, 
which would show the highest level of abil- 
ity and homogeneity in its finished product, 
all other considerations--such as aims and 
objectives, location, and personnel--being 
constant, would be the most desirable to re- 
tain. 

A few words may be said about the lim- 
itation of the techniques herein recommended 
for the study of institutional differences. 
One of the limitations has already been dis- 
cussed, namely, that the measures studied 
should follow the law of normal distribution. 
It was pointed out that percentile ranks do 
not follow this law and are, therefore, less 
applicable for analysis. This is not so se- 
rious, however, for E. S. Pearson has show} 
that the sampling distribution of n* is re- 
markably insensitive to changes in popula- 
tion form. This is only true, however, if 
the number of cases, n,, for each year is 
identical or practically identical. If the 
ny vary markedly from year to year, the n* 
test is only valid when the distributions 
are normal. Both techniques are less appli- 
cable, however, should the number of cases 
be too small; that is, ny + np < 20, or if 
n, and n», in the formula for the standard 
error of Zz are greatly different from one 
another, especially when n, is very small. 
In this case, the z cannot longer be related 
to its standard error; but Fisher's tables 
of z distributions* should be used for the 
interpretation. In these tables, the prob- 
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ability is given that z in either 5% or 1% 

of random samples would surpass a given quan- 
tity. Wherever the entries of the tables are 
not sufficient to cover n, and np, the S% or 
1% points value can be computed by the form 
alas: 5 


1.6449 
h-l 


5% point, and 


2.3263 
h-1 


1% point 


(15) z= 


- .7843(2- - 2) for the 
nN) Ne 


(16) z= ~ 1.235(2- - i) for the 
Nn) n 


The symbol h in these equations represents 
the harmonic mean of n, and np: 


= Mi Ne 
Nn, + Np 


h 


An illustration will serve to make the use 
of this test clear. The four z values which 
have been computed in this paper are com 
pared with the corresponding z value at the 
1% point computed by (16). The results are 
as follows: = T 


ny 





Z Value, theory] 
1% point 
calculated 


z Values, observed 





—+ 





Data, Institution A 
- .5897 3 
Institution B 

1.6774 3 
High School A 

1.2447 4 
High School B 


-.2750 4 144 


436 -6358 


Data, 
440 
Data, 
116 
Data, 




















The results reveal that by this test 
also the observed z values for Institution A 
and High School B are not significant. 

Pearson gives a similar test‘ by defin- 
ing 

_ observed n* - mean n° 
standard deviation 7? 


and asking the question as to what the value 
of x, or of n* (n* = meann*® + » standard 
deviation n»*), must be in order that the 








1. E. S. Pearson, op. cit., p. 118. 

2. Fisher, op. cit., p. 212. 

5. Ibid., p. 201. : 

4. K. Pearson, op. cit., Part II, p. xli. 
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tail cut off the distribution curve of »n* 
may be equal to +4.-ths or ths of the to- 
tal frequency. pr AL ant Ao measure the 
deviations from the mean n*, at which the 
chances of an n* occurring are respectively 
less than P,; = .0l, or Pp, = .02, supposing 
no correlation exists. When a value of n? 
gives a value of A greater than ai, then the 
existence of correlation may be reasonably 
predicted. When n* gives a value less than 
Ayo, then the existence of any correlation 
remains doubtful. The various values of Aj, 
A>» P,, and P, are here reproduced: 
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The use of the foregoing intermediate quan- 
tities and relations will reduce the labor 
of calculation an appreciable amount. 


Conclusion 


1. Two statistical methods have been 
presented by means of which an adequate study 
of institutional differences of student bod- 
ies in particular measures has been made 
possible. The first method was Pearson's 
technique of testing the significance of the 




















n* Values, observed ‘ § Theoretical Values Observed 
| « | Ty PL | Ae Pp r 
- —+—___— _ | —_——_—__ — ———— * a 
Data, Institution A 0021 4 | 440 | 3.20 | .o12 | 2.80 | .020 -.85 
Data, Institution B 1634 | 4 | 444 | 3.20 | .O12 | 2.80 | .020 28.48 
} | 
Data, High School A . 2936 5 | 1e1| 3.14 | 011 | 2.68 | .oz0 11.32 
} 
| 
Data, High School B 0158 5 149 3.14 | O11 jo 020 -.6 











Again, as this test shows, the data for In- 
stitution B and High School A indicate sig- 
nificant correlation; that is, the variation 
of the means in each case is larger than 
might be expected through mere chance. 

With regard to the computation of the 
different intermediate quantities occurring 
in the two techniques, it may be said that 
the complete calculation of standard devia- 
tions for each individual year is not neces- 
sary, since n* can be obtained from sums in- 
volving these standard deviations squared. 
The same is true with regard to the quantity, 
"sum of squares from variation of means", 
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which can easily be obtained from the rela- 
tion: 
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=< 3 (x - M)* 





correlation ratio n,,, if y, represents the 
group number or the number of years and x, 
the particular measure which is studied. The 
second method was Fisher's procedure of 
"analysis of variance." This method consists 
of the calculation and examination of the 
significance of z, the difference of the nat- 
ural logarithms of the estimate of variance 
from the variation of the means and the va- 
riance from the variation about the means. 
The two tests lead to essentially identical 
results, since z is a single-valued function 
of n*. 

2. It has been pointed out how the sta- 
tistical study of institutional differences 
may be made the basis for: (a) the estab- 
lishment of standards of admission, (b) the 
rating of high schools, (c) the coordination 
of institutions or units of an institution 
within a state or regional area, and finally 
how such a study may form a basic procedure 
in the determination of the educational ef- 
ficiency of institutions, 
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THE VARIATION OF HIGH SCHOOL SENIOR 
AND COLLEGE FRESHMAN CLASSES? 
by 
Vergil E. Lyon 











The purpose of this investigation is 
to determine the variation of the mean in- 
telligence of high school senior and college 
freshmen classes. The variation to be con- 
sidered will be between schools and between | in certain colleges. Colleges vary consid- 
successive classes in the same schools. The | erably, and guidance should be given to the 
specific questions to be answered are: What | students in this matter. Colleges should 
is the variation of thc mean intelligence / also know how consistently they are drawing 
of students graduated from different high students of a certain level of ability. 
schools? What is the variation of the mean The data? involved in this study con- 
intelligence of students admitted to differ-| sist of the results of tests administered 
ent colleges? What is the relation of the | to high school seniors in Wisconsin high 
mean intelligence of graduating classes to | schools during 1929, 1930, 1931, 1932, and 
the size of the school? To what extent do | 1933. These tests were given by the high 
classes graduated from the same school vary | school principals and scored at the Uni- 
in intelligence? To what extent do classes versity of Wisconsin.’ In 1929 the Ohio 
admitted to the same colleges vary in in- State University Psychological Test, Form 14 
telligence? was administered to 14,967 high school sen- 
The determination of the variation of iors. In 1930, 193l,and 1932, the 1930, 
high school senior classes will be of value _ 193l,and 1932 Editions of the American Coun- 
in admitting students to institutions of | cil Psychological Examination were adminis- 
higher learning. If high schools vary con- | tered to 16,329, 20,000, and 23,242 high 
siderably in the ability of their graduates, school seniors. In 1933 the Henmon-Nelson 
colleges admitting these graduates should | Test of Mental Ability, High School Exam- 
be made aware of this fact. Students com ination, Form A, was administered to 
ing from a high school with a very low mean 26,122 high school seniors. After these ex- 
intelligence could not be admitted with the | aminations were scored the scores were 
same assurance as those coming from a high transmuted into percentile ranks and re- 
school with a high mean intelligence. The corded. The fact that the results of these 
variation from year to year within the same tests are recorded as percentile scores 
school is also of significance in college makes a very serious limitation to one part 
admission. Unless high schools graduate of this study, to which reference will be 
students of a consistent quality no rating made later. 
of the high school can be made. If high Since some reduction in the number of 
schools are consistent in graduating stu- | high schools to be considered was necessary 
dents of certain qualities, colleges can only schools graduating at least thirty 
then adjust their entrance requirements to seniors in each of the five years for which 


take care of differences between high 
scnools. In the case of colleges, prospec- 
tive students and high school principals 
should know the level of ability accepted 













































1. This study has been made possible by funds from the Wisconsin Alumni Research Foundation. 

2. The writer is indebted to Professor V. A. C. Henmon for making these data available, and for many helpful suggestione. 

5. A detailed description of this testing program is given in V. A. C. Henmon and F. 0. Holt. "A Report on the Ad- 
ministration of Scholastic Aptitude Test to 84,000 High School Seniors in Wisconsin in 1929 and 1930." Bulletin of 

the University of Wisconsin, (June, 1951). 
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data were available were included in this 
study. The total number of schools in- 
volved was one hundred and eight and the 
total number of students was approximately 
60,000. 

The twenty institutions of higher 
learning in Wisconsin receive copies of 
these test results and then determine the 
mean percentile rank for their freshman 
classes. These means and the distribution 
of freshman scores constitute the data on 
which the study of the variability of col- 
lege freshman classes will be made. 

Since percentile scores on five differ- 
ent tests are to be compared it is necessary 
to study the equivalence of these scores. 
Since the percentile scores are based on 
such a large number of cases, at least 
14,967, and since the senior high school 
population probably does not vary over a 
five-year period the percentile scores are 
comparable. 

The question of the extent to which 
these different tests measure the same abil- 
ity can only be partially answered with the 
available data. The correlation between 





TABLE 





the Henmmon-Nelson Test and the American 
Council Psychological Examination; 1929, 
1930, and 1931 Editions; for 639 university 
freshmen is .73.1 This correlation when 
corrected for attenuation becomes .85. This 
correlation is sufficiently high to indi- 
cate that these gests measure the same 
ability. Further evidence on this point is 
given by the similarity of the content. 

The first step in the computation con- 
sisted in calculating the mean percentile 
score of each school for each of the five 
years. The mean percentile score for the 
total group from each school without regard 
to the year was also found. In most cases 
all computations were made on the ungrouped 
data. The means for the one hundred and 
eight high school senior classes are given 
in Tables I and III. The means for the 
twenty colleges are given in Tables II and 
IV. Inspection of these tables reveals 
some very striking figures. The means of 
the individual school means are fairly con- 
stant from year to year but the individual 
school means within any given year vary 
widely. In 1929 the range of these means 
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THE MEAN INTELLIGENCE AND VARIABILITY FOR 108 HIGH SCHOOLS 




















| Mean Mean Intelligence Percentile ] Measures of Variability 
Number | 1929 | 1930 | 1931 | 1932 | 1933 | Total | Range | A.D. | : : 
pp ph tt Ht Ht H+ 
1| se | 55.9 | 47.4 | 56.2 | 59.1 49.0 53.4 | 11.7 | 4.3 27 
2, 44 | 47.1 51.8 | 44.3 44.9 53.7 56.8 | 9.4 | 3.6 
3 149 | 49.7 56.7 | 48.3 39.2 | 48.8 46.3 10.5 | 3.1 | .68 
4 | 265 47.7 49.4 48.9 44.3 51.9 48.3 7.6 | 2.0 | .S2 
5 130 | 44.1 39.6 | 41.4 44.3 34.7 40.9 9.6 2.9 | 
6 72 63.0 65.0 | 74.1 69.0 | 67.6 67.9 11.1 3.1 | .39 
7 65 52.5 45.8 47.2 47.6 43.0 46.8 9.5 2.3 
8 79 49.1 42.7 55.8 45.6 | 44.3 | 47.2 13.1 | 3.9 51 
a 201 60.5 53.1 60.9 53.4 | 51.9 55.6 9.0 | 3.7 71 
10 64 36.1 44.9 44.1 45.8 54.9 44.6 17.9 4.0 53 
11 44 60.8 66.5 40.2 49.3 46.5 51.3 26.3 | 8.5 91 
12 54 55.7 49.0 44.2 44.3 49.0 48.1 11.5 | 3.4 
13 104 42.1 46.0 47.3 47.8 44.2 45.7 5.7 | 1.8 -.17 
14 50 48.2 52.9 48.3 55.0 49.2 50.7 6.6 | 2.6 
15 55 42.1 55.8 46.6 41.2 55.7 48.6 14.6 6.0 .67 
16 48 41.7 44.6 47.0 49.7 45.1 45.7 8.0 2.2 -.38 
17 42 52.6 43.3 47.2 43.2 43.3 45.7 9.4 3.1 
18 37 46.7 | ‘60.3 66.6 47.6 55.7 55.5 19.9 6.6 59 
19 36 44.2 53.1 61.1 54.6 53.5 52.9 16.9 3.9 25 
20 251 47.7 48.1 54.5 60.3 49.0 52.4 12.6 4.5 1.12 
21 52 51.4 42.8 44.8 45.2 48.6 46.2 8.6 2.7 






































1. Date furnished by Professor V. A. C. Henmon. 
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TABLE I (Continued) 








Mean Intelligence Percentile Measures of Variability 
1930 | 1951 1985 Range A.D. 
sal 2 = } 
33.9 | 
35.4 
46.7 
44.4 
55.5 
42.7 
46.3 
34.35 
34.0 
49.9 
52.1 
53.4 
52.8 
58.4 
53.6 
65.0 
55.6 
58.9 
49.1 
49.3 
60.4 
55.2 
45.2 
55.9 
63.4 
57.7 
66.1 
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TABLE II 
THE MEAN INTELLIGENCE AND VARIABILITY FOR 20 COLLEGES 





___ Mean Te test | Percentile " Measures of Variability 


r 1929 A 1930 | 1931 ] | 1933 33 | Total | Range | A.D. : 














69.9 | 72.7 

67.3 | 71.7 

65.8 | 78.1 
| 


ou 
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-.08 
1.44 
46 
- 46 
-.06 
-98 
-59 
1.66 
-.04 
44 
-99 
-.09 
- 46 
74 
-.38 
229 
-17 
-42 
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65.8 72.5 
65.0 60.7 
61.2 68.2 
60.1 67.3 
59.2 74.7 
57.7 48.3 
56.1 69.0 
54.7 
53.7 
53.2 
51.5 
49.6 
49.4 
49.3 
47.8 
45.8 
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TABLE III 






THE DISTRIBUTION OF MEAN PERCENTILE SCORES FOR 108 HIGH SCHOOLS 













































eee - , 
Percentile scores 1929 1930 4 1931 1932 1933 Total 

80 - 84 1 
75 - 79 1 1 
1 - 74 2 2 
65 - 69 4 6 3 5 2 3 
60 - 64 11 10 7 3 ? 5 
55 - 59 16 8 16 12 14 16 
50 - 54 | 23 22 =O 14 19 19 18 
45 - 49 | 20 19 | 32 34 33 41 
40 - 44 2 25 23 17 24 18 
35 - 29 | 6 13 6 14 m2] 6 
30 - 34 2 4 | 4 1 2 l 
25 - 29 1 2 

—" - aol ~ + — > } ; 
Wamber 108 108 108 108 108 108 
Range 26.9-73.4} 30.6-76.7| 32.3-80.0 | 27.6-78.4 | 33.7-67.6 | 33,1- 68.2 
Mean 50.69 49.12 | 49.77 48.52 | 48.98 | 49.54 
Sigma L 8.83 9.35 i. 6.98 | 8.60 | 7.30 6.94 
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THE DISTRIBUTION OF MEAN PERCENTILE SCORES FOR 20 COLLEGES 












































se ee nae — T ie T vy 
Percentile scores 1929 1930 1931 | 1932 | 1933 | Total 
—— $$} —- + - + 
80 - 84 1 
75 - 79 1 1 
70 - 74 1 3 5 5 4 
65 - 69 5 1 3 2 3 3 
60 - 64 2 1 5 3 2 
55 - 59 3 7 5 4 8 
50 - 54 4 2 6 2 3 3 
45 - 59 5 1 
—_ Re ae i + ' 
Number 19 5 20 20 20 20 
Range 45.8- 69.9 50.2-81.0 | 51.7-75.6 | 48.3-78.1 | 52.1-73.4 
Mean 57.007 61.28 60.98 63.95 | 63.47 61.62 
Sigma 6.78 i. 9.07 | 7.38 | 8.35 | 7.40 

































was from 73.4 to 26.9. The corresponding colleges as well as between individuals. 







range for the colleges is from 69.9 to Certainly the curriculum and methods of 
45.8. Similar ranges are found in the teaching camnot be the same in two schools 
other years. There are a few high schools graduating classes with means of 26.9 and 
in the state in which approximately half 73.4 nor can students from these two 

the graduating class is in the lower schools be accepted as equally promising 
fourth of the total high school senior college material. These differences are 





Class population. On the other hand there so large that ranks in graduating class 
are some high schools in which approximate- | and probably average high school grades can 
ly one half the graduating class is in the mean little unless the mean for the class 
upper fourth of the total population. Such is known. A student may be at the top of 
differences indicate the need of recogniz- the class in one school while if he had 
ing differences between high schools and gone to another school he might have 





















1. All means and sigma given in this table were computed from ungrouped date. 
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occupied an entirely different position. 
The differences between colleges are not as 
large as between high schools because the 
selection of college students is even more 
rigorous than high school graduation. How- 
ever, there is considerable variation in 
this respect since some colleges admit stu- 
dents having the same percentile score as 
is found in the total high school senior 
class and some others in which the mean is 
at or above the seventy-fifth percentile 
point. The quality of work which can be ex- 
pected of students in such colleges cannot 
be the same, 

In connection with these differences 
one might ask if size of graduating class 
or of school bears any relation to level of 
intelligence. Henmon and Holt+ found for 
the 1929 data a statistically significant 
difference in favor of schools graduating 
forty or more seniors as compared to those 
schools graduating between 10 and 39 sen- 
iors. Skewes and Wilson® found no correla- 
tion between mean percentile score and size 
of class using data from 1929, 1930, and 
1931 and an unselected group of one hundred 
high schools. 

Since only schools graduating at least 
thirty seniors per year have been consid- 
ered in this study direct comparisons with 
these two studies cannot be made. The cor- 
relation found between the means for the 
five years combined, total means in Table I, 
and the mean size of class for the one hun- 
dred and eight schools used in this study 
is .44, This is large enough to have some 
Significance. The scatter-diagram from 
which this correlation was computed showed 
many schools with less than the average 
number in graduating class to be above the 
average in mean percentile score but few 
schools of above average size which were 
below the average in mean percentile score. 

The second major problem of this study 
is the determination of the variation be- 
tween classes within schools. To what ex- 
tent do classes graduated from the same 
school vary? To what extent do classes 
admitted to the same college vary? Few 
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studies have been made of this type of va- 
riation due to the lack of data for succes- 
sive years or classes and the lack of an 
adequate statistical technique for the 
analysis of such variation. The usual 
technique for determining the significance 
of differences is inadequate since in this 
problem we are testing the significance of 
a series of differences rather than one dif- 
ference. 

One measure of this variability is the 
correlation between means of successive 
classes. Several studies using data from 
Wisconsin high schools have used this meas- 
ure. Skewes and Wilson found correlations 
of .27, .24, and .40 between the mean per- 
centile scores for 1929 and 1930, 1929 and 
1931, and 1930 and 1931, respectively. 
Table V gives the correlations between suc- 
cessive medians for all Wisconsin high 
schools. Tables VI and VII give the cor- 
relations of medians of schools graduating 
less than forty seniors and forty or more 
seniors. 


TABLE v® 


THE CORRELATION BETWEEN MEDIANS OF SUCCESSIVE 
SENIOR CLASSES, ALL SCHOOLS 





1930 1931 loze 
1929 -363 (434)* 

















-303 (427) -291 (438) 
1930 -390 (438) -363 (448) 
1931 .452 (447) 
TABLE VI 


THE CORRELATION BETWEEN MEDIANS OF SUCCESSIVE 
SENIOR CLASSES, SCHOOL WITH 
LESS THAN 40 SENIORS 




















1930 1931 1932 

1929 -318 (319) +232 (313) +221 (321) 

1930 -332 (322) -302 (331) 

1931 -393 (326) 
TABLE VII 


THE CORRELATION BETWEEN MEDIANS OF SUCCESSIVE 
SENIOR CLASSES, SCHOOLS 
WITH FORTY OR MORE SENIORS 











1931 





1930 ity |-s0a" 
1929 -582 (113) -600 (113 -604 (114 
1930 731 (116 -662 (116 
20 














l. Op. cit., p. 26. 
2 


- George Skewes and G. Alvin Wilson, "A Study of the Consistency of High Schools in Graduating Pupils Adapted for 


College Work", unpublished material. 


5. Data for Tables V, VI, and VII were furnished by Professor V. A. C. Henmon. 
4. Figures in parentheses indicate the number of schools on which the correlation is based. 
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The correlations for the one hundred 
and eight schools involved in this study 
are given in Table VIII. The correlations 
for the high schools are consistently high- 
er than those found by Skewes and Wilson. 
This is probably due to the difference in 
size of schools. Skewes and Wilson used an 
unselected group of one hundred schools 
containing some schools with as few as five 
seniors. 

The correlations for the twenty Wis- 
consin colleges are given in Table IX. Since 
the data for all colleges was not collected 
in 1930 this year is omitted from these 
calculations. These correlations are 
slightly higher than the corresponding cor- 
relations for the high schools and indicate 
that colleges admit classes of more nearly 
the same ability from year to year than 
high schools graduate. 


TABLE VIII 


THE CORRELATION BETWEEN MEAN INTELLIGENCE 
PERCENTILES FOR 108 HIGH SCHOOLS 





1930 
-609 


1931 
597 
733 


1932 
-543 
-669 
- 736 


1933 
241 
-504 
-631 
579 




















TABLE IX 


THE CORRELATION BETWEEN MEAN INTELLIGENCE 
PERCENTILES FOR 20 WISCONSIN COLLEGES 





1931 
-79 


1932 
77 
84 


1933 
-68 
-79 
-78 





1929 
1931 
1932 














While these correlations between 
names of successive classes give some indi- 
cation of the variability from year to year 
a measure giving a more precise statement 
is needed. Three additional measures have 
been used in this study. These measures 
are: (1) the largest difference or the 
range of the yearly means, (2) the average 
absolute deviation of the yearly means 
around the total means, and (3) Fisher's z 
function which is one half. the difference 
between the natural logarithms of an esti- 
mate of the variance between classes and an 
estimate of the variance within classes. 
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The largest differences between means are pre- 
sented in Tables I and II under the heading 
"Range." Table X gives these differences in a 
frequency distribution. The rangp of these differ- 
ences is from 3.2 to 33.5 for the high schools 
and from 3.8 to 15.9 for the colleges. The 
range of differences is much less for the col- 
leges and the mean difference is also less for 
colleges, This confirms the results of the cor- 
relations between successive classes. Colleges se- 
lect more nearly the same type of classes 
from year to year than high schools gradu- 
ate. It is difficult to establish a point 
above which these differences are signifi- 
cant but it would probably not be larger 
than twenty. Classes which varied in mean 
percentile score by twenty points would ap- 
pear to be significantly different. 


TABLE X 


THE DISTRIBUTION OF LARGEST DIFFERENCES 
FOR 108 HIGH SCHOOLS AND 20 COLLEGES 





Differences | High Schools 








33 - 35 1 
30 - 32 
27 - 29 
24 - 26 
21 - 23 
18 - 20 
15 - 17 
12 - 14 17 
9-1ll 32 


: 
_ Colleges _ 
| 
| 
| 


6-8 26 
3-5 9 





108 20 
12.21} 9.721 
5.91) 3.91) 


Number 
Mean 
Sigma 














The second measure of variation from 
year to year is the average absolute devia- 
tion of the yearly means around the total 
mean. These deviations are given in Tables 
I and II under the heading "A.D." They are 
given in a frequency distribution in Table 
XI. These tables reveal a wide range in 
average absolute deviation. The most va- 
riable high school has an average absolute 
deviation of 9.1 while the least variable 
has an average absolute deviation of only 
1.0. The corresponding figures for the 
colleges are 6.5 and 1.2. The mean average 
deviation for the high schools is 3.74 and 
more than one half the schools fall 





1. Computed from ungrouped data. 
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between 2.0 and 4.0. Comparison of these 
deviations for high school senior classes 
and college freshman classes indicates that 
college freshman classes are less variable. 


TABLE XI 
THE DISTRIBUTION OF AVERAGE ABSOLUTE 


DEVIATIONS FOR 108 HIGH SCHOOLS 
AND 20 COLLEGES 














Average absolute | High Schools Colleges 
deviations 
9.0 - 9.4 1 
8.5 - 8.9 2 
8.0 - 6.4 1 
7.5 - 7.9 2 
7.0 - 7.4 2 
6.5 - 6.9 2 | 
6.0 - 6.4 3 1 
5.5 - 5.9 2 2 
5.0 - 5.4 5 
4.5 - 4.9 “ | 
4.0 - 4.4 5 | 1 
3.5 - 3.9 14 6 
3.0 - 3.4 18 1 
2.5 - 2.9 16 1 
2.0 - 2.4 | 18 | 3 
1.5 = 1.9 5 3 
1.0 < 1.4 4 2 

= a +—__—__—_—___—_— 

Number 108 20 
Mean 3.75 | 3.232 
Sigma 1.74 | 1.531 





It is very difficult to interpret 
this measure of variability except in com- 
paring one school with another. An average 
absolute deviation of a given size may be 





due to this deviation in each class or to a | 


very large deviation in one or two classes. 
Practically the second situation would be 
the most significant. 

As one would expect the average abso- 
lute deviation is related to the size of 
the classes. For the hich school data the 
correlation between these two measures is 
-.45. Inspection of the scatter-diagram on 
which this correlation was computed sug- 
rested the possibility of a non-linear re- 
lation. However, since the correlation ra- 
tio was only .47 the product-moment corre- 
lation can be used to represent this rela- 


- Computed from ungrouped data. 


me 


pp. 178-215. 


. Re Ay Pisher, Statistical Methods for Research torkers. 
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| population. 
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tionship. This correlation would probably 
have been even higher had smaller schools 
been included. 

The question of the relation between 
the average absolute deviation and the total 
mean was also found. This correlation is 
.08. There is no tendency for schools on 
one level to vary more from year to year 
than others. 

The third measure of variability em 
ployed is derived from an estimate of the 
variance within years and an estimate of 
the variance between years. This method 
for the analysis of variance into two or 
more parts was developed by Fisher*® and its 
application to this particular has been sug- 
gested by Reitz.5 

This technique tests the significance 
of the difference between the part of the 
total variance which is due to variation of 
individuals within each successive class 
and the part due to variation in the means 
of successive classes. In case the part of 
the total variance contributed by the va- 
riation between classes is significantly 
larger than that part due to the variation 
within classes the successive classes can- 
not be considered as coming from the same 
population. Or, stated in different terms, 
in such a case the system of means could 
not be considered as arising by drawing 
successive samples from the same parent 
Some factor other than chance 
must have been operating. The value whose 
significance is tested is not the differ- 
ence between these two estimates of the va- 
riance but one half the difference between 
the natural logarithms of these estimates. 
This quantity is known as (z). In case the 
number of degrees of freedom corresponding 
to the larger variance, (n,), and the num 
ber of degrees of freedom corresponding to 
the smaller variance, (n,), are large or 
moderate in size and nearly equal the sig- 
nificance of (z) can be determined by re- 
latins it to its standard deviation. Since 
in the data for the high schools one of the 
degrees of freedom will be four and the 
other will be not less than 180, this test 





Oliver and Boyd: Edinburgh, (1920), Third Edition, 


5. Wilhelm Reitz, "Statistical Techniques for the Study of Institutional Differences", Journal of Experimental Educa- 


tion, III (September, 1924), pp. 
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of significance cannot be used. Fisher 
vives tables! which permit an interpreta- 
tion of the significance of (z) in terms of 
probability. These tables give (z's) cor- 
responding to probabilities of .05 and .01 
for certain values of n, and n,. The z 
corresponding to a probability of .0l, mn 
of 4, and n, of 60 is .6472. The corre- 
sponding 2 for an n, of 4 and n>p infinitely 
large is .5999. The number of degrees of 
freedom corresponding to the smaller va- 
riance, n,, involved in the data of this 
study do not occur in these tables. For 
this reason some arbitrary limit mst be 
set since it would be impractical to com 
plete these tables for all the different 
no's involved. The point z = .6472 has 
been arbitrarily chosen. The probability | 
of a z of this size arising through chance | 
is not far from .01. In terms of the prob- | 
| 
| 
| 
| 





lem of this study a school in which (z) 
equals or is greater than .6472 cannot be 
considered as drawing its senior classes 
from the same parent population. Or in the | 
usual terms, the differences between the 
successive means are statistically signifi- | 
cant. 
Before presenting the data on this 
point it is necessary to consider a limita- | 
tion of the application of this technique 
to these data. This technique assumes a 
normal distribution of the variate in the | 
population from which the samples are drawn. 
The variate studied in this investigation 


has been measured in percentile units which | 


yield a rectangular distribution rather 
than a normal distribution. This fact un- 
doubtedly has affected the results probably 
in the direction of exaggerating the dif- 
ferences. The results must be accepted as 
tentative until this test of significance 
has been revised to take care of rectangu- 
lar distributions. 

The z's for the most variable high 
schools are given in Table I and for the 
colleges in Table II. The 2's have been 
computed for those schools in which the va- 
riation seemed by inspection to be signifi- 
cant and a few others. If one accepts the 
point .6472 as the limit above which dif- 
ferences between successive means are sig- 
nificant twenty-eight of the one hundred 


' 
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and eight schools are more variable from 
year to year than would be expected from 
chance. In these twen'y-eight schools suc- 
cessive classes cannot be considered as 
drawn from the same population nor can the 
total mean be used as a measure of the abil- 
ity of students from these high schools. 
There are a few negative values of z given 
in Tables I and II. This indicates that 
the estimate of the variance between class- 
es is less than the estimate of the vari- 
ance within classes. None of these nega- 
tive values are larger than would be ex- 
pected through chance. 

There are six of the twenty colleges 
which have z's greater than .6472. These 
freshman classes cannot be considered as 
drawn from the same population. 

A word of caution should be given in 
interpreting these results. This technique 
gives only an indication of the statistical 
Significance of certain values of Z. The 
probability of getting a z of .6472 or 
larger through chance is less than .Ol. 
This does not tell one much about the prac- 
tical significance of the variation. School 
Number 108 has a Z value of 1.02 and yet 
the variations from year to year are rela- 
tively small and probably not of any prac- 
tical importance. School Number 10 ha 
z of only .53 and yet its variation from 
year to year is larger when measured bott 
by range or largest difference and average 
absolute deviation. There is no reason why 
a certain deviation should be considered 
practically significant in a large school 
but not in a small school. The conclusion 
one can reach regarding these twenty-eicht 
high schools and six colleges is that the 
means of successive classes vary so widely 
in comparison to the variability within 
classes that some factor or factors other 
than chance have been operating. 

Earlier in this study it was shown 
that the variation in means of high schools 
is very large. In 1929 these means varied 
between 26.9 and 73.4 and the means for the 
total five-year period varied between 33,] 
and 68.2. It was pointed out that ranks in 
graduating classes need some adjustment to 
take account of this variation. At the 





l. Op. cit., pp. 212-215. 
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university success are made on the basis of 
rank in graduating class and the percentile 
scores used in this study. The middle stu- 
dent coming from high school number 31 
which has a total mean percentile score of 
33.1 would have the same rank in graduating 
class as the middle student coming from 
high school number 43 which has a total 
mean percentile score of 68.2. These stu- 
dents cannot be equally promising college 
material. Reitz] has suggested and ap- 
plied a technique for correcting ranks in 
graduating classes and also mean high 
school grades for this variation between 
schools.’ He points out, however, that such 
a technique is dependent on the variabil- 
ity from year to year within the same 
school. Since the z's in twenty-eight of 
the schools employed in this study are so 
large that the total means cannot represent 
these schools the adjustment could not be 
applied to them. 

If such an adjustment of ranks is 
needed one would expect to find that the 
predictions of university success would be 
too high for students coming from high 
schools with low total means and too low 
for students from high schools with high 
total means. Table XII indicates that this 
is the actual situation. In every year the 
predictions of university success in terms 
of grade point averages are too low for the 
students coming from the thirty high schools 


| vary between 33.1] and 68.2. 
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with the highest total means. In every year 
but one the predictions for students coming 
from the thirty schools with the lowest to- 
tal means are too high. Since the number 
of students coming to the University of Wis- 
consin from these schools after excluding 
those from schools in which the variation 
from year to year is larger than would be 
expected from chance is so small this ad- 
justment could not be made. 

In conclusion the following statements 
can be made. In the sample of one hundred 
and eight Wisconsin high schools there is 
considerable variation in mean percentile 
score between high schools. The means for 
the senior classes over a five-year period 
The total means 
for the freshman classes in twenty Wisconsin 
colleges vary between 52.1 and 73.4. These 
variations are related to size of school as 
is indicated by a correlation of .44 between 
the total means and mean size of graduating 
class. In general the larger schools have 
the larger means. The need for an adjust- 
ment of ranks in graduating on the basis of 
these total means has been indicated. Four 
measures of variability within schools have 
been presented. These measures are: The 
correlation between means of successive 
classes, largest difference between means, 
average absolute deviations of the yearly 
means around the total means, and Fisher's 
z function. The correlation between 


TABLE XII 


ERROR IN PREDICTED GRADE POINT AVERAGES 


























| ¢ Predicted Actual 
Number | Mean nae | Mean High Grade Point | Grade Point | Error 
| tile Score | School Rank | Average | Average 
High Group | | | | 
1929 354 59.37 | 60.15 | 1.152 | 1.184 | -.032 
1930 367 62.56 | 62.008 | 1.24 | 1.391 | =0145 
1931 283 64.04 | 62.52 1.284 1.347 | =.063 
1932 298 53.49 | 60.19 1.047 1.245 | -.198 
1933 270 | 54.74 64.18 1.147 1.217 -.070 
} ote WERE > ge ++ 
Low Group 
1929 41 | 52.78 | "97.39 _ 1.373 1.012 +.361 
1930 52 52.41 | 74.22 1.304 1.142 | +,162 
1931 40 | 45.68 70.88 1.117 1.148 | -,031 
1932 51 53.91 | 76.76 1.381 1.094 | +.287 
_ _ 1933 55 50.69 | 75.94 1.305 1.259 | +.046 


























1. Wilhelm Reitz, "A Method of Adjusting School Marks and Class Ranks to Compensate for Variations in Standards of 
Marking", Bulletin of the American Association of Collegiate Registrars, IX (October, 1923), pp. 1-14. 





2. These percentile ranks are those computed on the basis of college students rather than the state senior class 
population. 
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successive classes was found to range be- 
tween .579 and .736. Corresponding figures 
for the college freshman classes are .84 
and .78. The mean largest difference for 
the high schools is 12.21 and 9.72 for the 
colleges. There is a much wider range of 
largest differences for the high schools 
than for the colleges. The mean average 
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absolute deviation for the high schools is 
3.75 and 3.23 for the colleges. On the 
basis of these three measures college 
freshman classes vary less from year to 
year than do high school senior classes. 
Twenty-eight high schools and six colleges 
have variations between classes which would 
occur by chance in less than one out of one 
hundred cases. 














FIVE YEARS OF EXPERIENCE WITH A REMEDIAL READING 
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COURSE FOR COLLEGE STUDENTS? 


by 


J. R. Gerberich 


The first re-training course in reading 
and methods of study at the University of 
Arkansas was introduced in 1928-29. During 
that year approximately forty probation stu- 
dents were given a course of this nature for 
eighteen weeks extending from the middle of 
the first semester to the middle of the sec- 
ond semester. Since that year approximately 
fifty freshmen whose reading abilities were 
found to be very deficient have been re- 
quired to take remedial work during the 
first semester of each year. 

Scores made by the freshmen on the 
reading and psychological examinations of 
orientation week were used as the major cri- 
teria in selection of students for the spe- 
cial course after 1929, with scores on the 
reading test as the more important of the 
two. Both the Study Performance Test, pub- 
lished by Ohio State University, and the 
American Council Psychological Examination 
were used with the entering classes of 1929 
and 1930. The Ohio State University Psycho- 
logical Examination has been used since 1930 
to give a measure of general scholastic ap- 
titude, while Part III of that examination, 
a reading test, has furnished a reading ex- 
amination score. A few students who ranked 
exceptionally low on both the reading and 
psychological examinations were not included 
in the course because it was felt that their 
chances of success in college were almost 
hopeless. College entrants markedly over 
the average age for freshmen were excused 
and a few others were excluded for adminis- 
trative reasons relating to their curricula, 
schedules, and the like. 


University of Arkansas 





LDr. C. E. Prall, Dean of the College of Education of the University of Arkansas, has made many helpful suggestions and 
has contributed much of value in the preparation of this article. 


From its inception, the special reading 
and methods of study course was given for 
two separate hours weekly without credit. 


The schedules of most of the students were 


reduced to thirteen or fourteen hours of 
credit work, although in a few cases the 
special course was superimposed on a normal 
schedule of fifteen or sixteen hours. The 
students were sectioned in two groups number- 
ing approximately twenty-five each, thus per- 


mitting some attention to the problems of the 


individual during the class period. Confer- 
ences, analyses of results obtained from oc- 
casional tests, reports of scholastic stand- 
ing, and other available types of information 


were utilized in an attempt to adapt the work 


to the needs of the individual. 

Although the class procedure varied con- 
siderably, with new materials and methods in- 
troduced from year to year, the method of 
conducting the course remained relatively 
constant in the important elements. Varying 
types of reading exercises, accompanied by 
short check-tests, formed the major portion 
of classroom procedures, Discussion of the 
results of these exercises and comments by 
the instructor were interspersed with ob jec- 
tive test exercises of standardized and un- 
standardized types. The units of the course 
dealt with such reading and study skills and 
abilities as word meaning, sentence meaning, 
paragraph comprehension, comprehension of 
longer selections, obtaining the main idea of 
selections, summarizing short thought units, 
outlining, taking reading and class notes, 


interpreting graphs, and increasing reading 
speed. 
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By the use of a control group, it was 
possible to obtain objective evidence con- 
cerning the efficiency of the remedial read- 
ing course for probation students during the 
year 1928-29.’ Probation students with scho- 
lastic and intelligence ratings comparable 
with those of students required to take the 
special reading work were selected for the 
control group. The two groups were compared 
on the following criteria: (1) final scores 
on an objective reading examination given 
at the close of the experiment, (2) scholas- 
tic standing at the close of the experiment 
and throughout the following school year, 
and (3) persistence in the University for 
more than a year following the close of the 
experiment. 


J. R. Gerbderich 





Scores made by experimental and control 
students on the initial and final presenta- 
tion of an objective reading test showed 
that the experimental students made twice as 
great a gain as the control students. Scores 
made by the two groups on an outlining and 
note-taking test, which made up a portion of 
the final measure, showed the experimental 
group to surpass the control group by a dif- 
ference of 3.4 times the probable error. 

The experimental students raised their 
scholastic standing 0.32 grade points® dur- 
ing the eighteen weeks of the experiment, 
while the control students lost 0.08 grade 
points during the same period. During the 
year following the experiment, students of 
the experimental group averaged 0.29 grade 
points higher than the control students, 
while carrying on average of 1.2 more hours 
of course work. 

At the close of the experiment, 84 per 
cent of the experimental group and 65 per 
cent of the control group were still in 
school. Basing further persistence data on 
thirty-one experimental and thirty control 
students who completed the second semester 
of the year of the course, it was found that 
42 per cent of the former and 30 per cent of 
the latter persisted long enough to enroll 
for the first semester of 1930-31. 

In order to measure the efficiency of 
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it was necessary to establish some type of 
control group for comparison with the spe- 
cial reading group. The achievement meas- 
ures used as indices of final attainment 
were general scholastic success for the 
first and subsequent semesters of attendance, 
percentages of students persisting varying 
numbers of semesters in attendance, percent- 
ages of students placed on probation because 
of poor scholarship, percentages of students 
dropped from their colleges of registration 
because of poor scholarship, and percentages 
of withdrawals during the progress of a se- 
mester. Table I presents evidence concern- 
ing the validity of scores on the reading 
and psychological examinations for use in 
establishing a control group to be compared 
with the experimental group. 


TABLE I 


COEFFICIENTS OF CORRELATION BETWEEN GRADE POINT 

AVERAGES FOR THE FIRST SEMESTER OF UNIVERSITY AT- 

TENDANCE AND SCORES ON THE READING AND PSYCHOLOG- 
ICAL EXAMINATIONS 


eS 


| 
|G.P.A. vs. Reading | G.P.A. vs. Psych. 








Year| ___Exam.* | __—sCEXam.<## 
[No. of | | | No. of | 

Cases |r | PLE. | Cases e | PB. 
oa Ps em 
/1g29| 2oseee | +47 | .03 | 436 46 | .03 
1930| 254**# | .62 | .03 | 369 | 66 | .o2 
/1931| 392 §©=©| .46| .03 | 392 | .55 | .02 
| 1932| 365 | 56 .02 | 365 | .62 | .0o2 














*The tests used were: 1929 and 1930 - the 
Study Performance Test, Form A, published by 
Ohio State University; 1931 and 1932 - Part III 
of the Ohio State University Psychological Ex- 
amination. 

*#*The tests used were: 1929 and 1930 - the Amer- 
ican Council Psychological Examination; 1931 
and 1932 - the Ohio State University Psycholog- 
ical Examination. 

*##As students entering the College of Engineering 
were not required to take the Study Performance 
Test, these correlations involve fewer cases 
than those based on the psychological examina- 
tion. 


The degrees of relationship indicated 
by the coefficients between the two initial 








the remedial reading course given since 1929,' measures and the most important of the final 





° t of data which are briefly summarized here are as follows: r 
* i teen Unpublished, 1929, 7 pages, and J. F. Gerberich, "Subsequent 


of a Re-Training Experiment with Probation Students", 


University Careers of Probation Students in the Re-Training Experiment of 1928-29", 
2. A weighted grade point average based on the following system of weighting semester marks: 


and F -2, 


C. E. Prall and G. N. Cade, "The Results 


Unpublished, 1951, 5 pages. 
A6,B4,C 2,00, E-1l, 
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criteria warranted the use of the reading 
and psychological test scores for the pur 
pose indicated. The relationships between 
scholastic success and the other criteria of 
final achievement--persistence, probation 
status, dismissals for poor scholarship, and 
voluntary withdrawals--are all sufficiently 
close to warrant the use of the same equat- 
ing factors for all final comparisons. 

The students comprising the special 
reading class have been drawn from those 
ranking in the lowest twelve to fourteen per 
cent on the reading test and largely from 
the lowest fourth of the psychological test 
ranking. Since nearly all the students 
scoring in the lowest twelve to fourteen per 
cent on the reading test were required to 
take the remedial course, control students 
were not available for close equation with 
special reading students in the attempt to 
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The data of Table II show that the con- 
trol group in all cases surpassed the exper- 
imental group in mean scores on both the 
reading and the psychological examinations. 
Furthermore, that the differences were sta- 
tistically significant is shown by the fact 
that they were all more than four times their 
probable errors. 

The usual procedure in a controlled 
group comparison is to look to differences in 
final achievement between two initially- 
equated groups for evidence concerning the 
relative efficiencies of the experimental and 
control techniques. As has been indicated 
above, a close matching of experimental and 
control students was not possible, so the two 











‘groups were established without any attempt 
ito match individual students. Here a signif- 
| icant gap existed between the ability levels 


|of the two groups. If, in the final meas- 


measure the efficiency of the remedial train-|ures, the disparity between the achievement 


ing. In lieu of exact controls, therefore, 
students were selected for the comparison 
whose scores on the reading test were just 
above the scores made by the students of the 
special course. Their rankings on the read- 
ing test were in the main from the twelfth 
to the twenty-third percentiles, although a 
few control students would originally have 
been included in the special reading group 
except for some difficulty in arranging the 
schedules or like matters. 
and women was kept constant for the two 
groups. Table II gives the status of the 
experimental and control groups on both the 


of these groups is significantly less marked 


| than the difference which existed between 
— in the beginning, it may well be assumed 
(that the instructional efficiency of the ex- 
| perimental technique has been effectively 
demonstrated. 

The two most significant measures of fi- 
‘nal status for this study are general scho- 
‘lastic success, as measured by grade point 
|averages, and persistence in the University. 


The ratio of men |The first of these is dealt with in Table III. 


Only those students who completed at least 
one semester of attendance or a sufficient 
portion of it to have final semester marks on 

















reading and psychological examinations. record are included in the tabulations. The 
TABLE II 
INITIAL STATUS OF CONTROL AND EXPERIMENTAL GROUPS ON 
THE READING AND PSYCHOLOGICAL EXAMINATIONS 
Control Experimental | 
Examination | Year| No. Mean Mean Diff. 
Score P.E, No. Score P.E. Diff. P.E.asee P.E.airr 
Reading 1929 | 59 | 111.27 + 1.46 | 48 86.15 + 1.53 | 25.12 2.12 11.85 
1930 | 54 95.47 + 1.73) 45 71.34 + 1.28 | 24.13 2.15 11.22 
1931 | 55 20.27 + 0.29 | 48 15.83 + 0.29 4.44 0.41 10.83 
1932 | 57 22.16 + 0.30 | 47 15.75 + 0.27 6.41 0.41 15.63 
Psych. 1929 | 59 92.81 t 2.44 | 48 76.38 + 1.87 | 16.43 3.08 5.33 
1930 | 54 96.17 + 3.68 | 45 65.50 + 2.10 | 30.67 4.24 7.23 
1931 | 55 67.46 + 1.59 | 48 52.19 + 1.79 | 15.27 2.40 6.26 
1932 | 57 67.98 + 1.52 | 47 47.24 + 1.01 | 20.44 1.82 11.23 





























ee 





is hie ee ARE elas 


fir 
sem 
and 
in 
sec 
cor 
con 
anc 
de! 
se’ 
mo’ 
sh 
tr 
st 
be 








ag 
sd 
a 
3 
er 
& 
5 
= 
Pd 
a 
:. 
i 
Ey 
Ly 
; 
= 
t 
bg 
Ee 
& 
z 
a 


September, 1934 


first semester data are based on the first 
semester of attendance for the 196 control 
and 165 experimental students who remained 
in the University at least that long; the 
second semester data are based on the 115 
control and 92 experimental students who 
completed at least two semesters of attend- 
ance, etc. The decrease in number of stu- 
dents in attendance from the first to the 
seventh semesters is due both to student 
mortality and to the time limitation which 
shortened the attendance records for en- 
trants of recent years. Thus the number of 
students as reported in the table should not 
be taken as an index of persistence. 
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Information concerning the persistence 
of experimental and control students by col- 
leges of initial registration is given in 
Table IV. The students entering in each of 
the four years are again combined for the 
first, second, and subsequent semesters of 
attendance. The "Total" column in each sec- 
tion of the table shows the number of stu- 
dents who had sufficient time since entrance 
to put in the designated number of semesters 
in regular-session attendance. The final 
columns of each section give the percentages 
of students actually in attendance each num- 
ber of semesters relative to the total num- 
ber of those who could have attended that 


TABLE III 


SCHOLASTIC SUCCESS OF CONTROL 


AND EXPERIMENTAL STUDENTS IN 


RELATION TO PERSISTENCE IN THE UNIVERSITY 








Semester Control 
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Attendance G.P.A. P.E. 
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An examination of the mean grade point 
averages of Table III shows that the control 
group ranks consistently higher than the ex- 
perimental group. In only one instance--the 
first semester of attendance—-is the differ- 
ence a significant one, however. Therefore, 
it may be concluded that those control stu- 
dents who persisted more than one semester 
in attendance did not attain grade point av- 
erages differing from those of the compara- 
ble experimental students by a statistically 
significant amount. It is recognized, of 
course, that the two groups are not here 
paired exactly in terms of the data reported 
in Table II, for the withdrawals of some 
students had destroyed exact comparability. 
However, the same factors were in operation 
to affect withdrawal and persistence in the 
two groups, so that the conclusions drawn 
from the data of this table seem fair to each 
group. 


period of time. 

It is evident from the data of Table IV 
(see following page) that the experimental 
students remained to complete the first se- 
mester in a slightly greater percentage of 
cases than did the control students. Per- 
sistence is greater for the control group 
during the following four semesters. The 
maximum difference between percentages of 
the total remaining in attendance is the 10.2 
per cent in favor of the control group for 
persistence of at least four semesters. For 
persistence beyond four semesters, the num- 
bers of cases are hardly adequate as a basis 
for more than tentative conclusions, but it 
appears that what differences do exist are 
at least not unfavorable for the experimental 
group. 

In Table V (see following page) are pre- 
sented data concerning the last three cri- 





teria of final achievement mentioned above: 
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THE CONTROL AND EXPERIMENTAL GROUPS 
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TTENDANCE FOR STUDENTS OF 


















T aie —— ee ee ees ve He 
Control Experimental 
} Sem. of .- Ao) ne aa tlhlmUmUlUt~™ | +n | Per cent in | k 
Attendance | Total | attendance attendance | Total | attendance | attendance | Diff.# 
_—- - +-——- —-— + — + — —— - —- _- a oo + 

First | 225 | 196 87.1 iss | s165 | 87.8 -0.7 
Second 168 115 | 68.5 141 92 65.2 3.3 
Third | 168 | 80 47.6 | Mm 57 | 40.4 7.2 s 
Fourth | 113 | 48 42.5 93 30 32.3 10.2 4 
Fifth a 33 29.2 93 22 | 23.7 5.5 . 
Sixth | so | 13 22.0 48 14 29.2 -7.2 ‘ 
Seventh | 59 | 4 | 15.3 | 48 | 7 | 14.6 |_0.7 



















*Positive differences show greater persistence for 
the experimental group. 


PROBATION AND WITHDRAWAL 





TABLE V 





the control group and negative differences for 


HISTORY FOR STUDENTS OF 








THE CONTROL AND EXPERIMENTAL GROUPS 
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Placed on Probation 51 45 96 


Dropped for Poor 
Scholarship 225 | 8 | 17 | @ 
Withdrew Voluntarily | 225 | 24 | 7 | 21 


oat Control a : Experimental : 
Group [All | First | | Per | All | First Per 
Cases Sem. Later | Total | Cent | Cases Sem. Later | Total Cent 





42.7 





188 59 





40 





99 






| 11.1 | 188 | 6 | 18 | 2% 12.8 
| 13.6 | 188 | 16 | 7 | 2 12.2 












percentages of students placed on probation, 
percentages of students dropped for poor | 
scholarship, and percentages of students 
withdrawing before the completion of a se- 
mester for which they had registered. The | 
results are presented separately for the 
first semester of the freshman year and sub- 
sequent semesters of attendance, because 
scholastic difficulties and mortality are | 
usually much more crucial and serious during | 
the first semester than at any other time. | 
The probation data of Table V show that 
} 43 per cent of the control group and 53 per 
cent of the experimental group were at one 
time or another placed on probation because 
of low scholarship. Many of the students so 
classified were subsequently removed from 

















probation; others were dropped for poor schol- 
: arship, withdrew while on probation, or are 
at | still in attendance on probation. More than 


half of the probation students in each group 
. were first placed on probation before the 
Close of the first semester of attendance. 



















Many of the remainder were placed in that 
status immediately after the close of the 
first semester. 

Eleven per cent of the control group 
and 13 per cent of the experimental group 
were dropped from their college of initial 
registrat*on because of poor scholarship. 
Some reentered later or gained admission to 
other colleges of the University. Less than 
half of this group were dropped before the 
close of the first semester, but many of the 
remainder were dropped so shortly after the 
close of the first semester that they were 
not allowed to register for the second se- 
mester,. 

Fourteen per cent of the control group 
and 12 per cent of the experimental group 
withdrew before the close of a semester for 
which they had registered and during which 
they had attended classes. 

The differences shown by these data in- 
dicate that the control students encountered 
fewer scholastic difficulties than the 
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experimental students. However, in only one| on the two initial measures—-scores on read- 
of the three comparisons--students placed on| ing and psychological examinations. There- 
probation--is the difference of sufficient fore, control students were selected whose 
size to merit attention. scores on these tests were in most cases 
just above the scores made by the experimen- 
| tal students. The differences in mean 
Summary scores made by the two groups were found to 
| be statistically significant and to show the 
The purpose of this investigation was | control students to be distinctly superior 
to obtain objective information concerning on the two initial measures. 
the efficiency of a remedial reading course | Differences in the standings of the ex- 
required of approximately fifty University | perimental and control groups on the final 
of Arkansas freshmen each year since 1929. | criteria showed a tendency toward higher 
The following factors were used as criteria | scholastic averages, greater persistence in 
of final achievement: (1) general scholastic| the University, and few scholastic difficul- 
success for the first and subsequent semes- ties for the control students than for stu- 
ters of University attendance, (2) percent- | dents in the experimental group. However, 
ages of students persisting varying numbers | these differences were found to be relative- 
of semesters in attendance, (3) percentages ily small and almost without exception lack- 
of students placed on probation because of | ing in statistical reliability. It becomes 
poor scholarship, (4) percentages of stu- | evident, then, that the control students, 
dents dropped from their colleges of regis- | showing superiority of mean scores on two 
tration because of poor scholarship, and | Sone tests of determined validity, did 
(5) percentages of students withdrawing dur- not maintain their initial superiority on 
ing the progress of a semester. | the criteria of final achievement. It may 
It was impossible to select a group of (well be assumed, therefore, that the marked- 
control students having abilities similar to | ly less disparity existing between the final 


the abilities of the experimental students | status of the two groups than that which ex- 


| isted on the initial factors is evidence of 
the instructional efficiency of the experi- 
mental technique. 








INTERCORRELATIONS OF 





Leo J. 
University 


In the Critique of the Twenty-ninth 
Yearbook of the National Society for the 
Study of Education it was proposed that 
four major functions of arithmetic instruc- 
tion should be recognized, namely, the com 
putational, the informational, the sociolog- 
ical, and the psychological functions. 

If one accepts this view that arithme- 
tic should be taught as a social study 
rather than as a tool subject, he is 
cerned with the devising of methods of 
evaluating the extent to which all of the 
desired objectives of the subject are be- 
ing achieved, 

In an attempt to make available a 
means of measuring the extent to which 
these functions are now being achieved, the 
writer devised a series of four scaled 
tests! dealing with several major aspects 
of these functions. One of the tests in- 
cludes scaled exercises (examples in mixed 
processes) in computation; the second con- 
sists, of scaled verbal problems of the con- 
ventional type; the third contains scaled 
exercises of the multiple choice type for 
measuring understanding of quantitative vo- 
cabulary, one aspect of the psychological 
function; the fourth test, called Under- 
standing of Quantitative Relationships, in- 
cludes scaled exercises of the multiple 
choice type for evaluating the ideas and 

the range of information pupils have con- 
cerning social institutions such as banking, 
money, taxation, business practices, appli- 
cations of number, etc., that are empha- 
sized by the informational and sociological 
functions of arithmetic. The last two tests 
did not contain any exercises involving com 
putation, while all of the exercises in the 
first two tests did involve computation. 


con- 
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ARITHMETICAL ABILITIES 
by 

Brueckner 

of Minnesota 


This paper presents a brief summary of 
the results secured when these tests were 
administered to a group of 453 pupils in 
grades 4A-5B in typical schools of St. Paul. 

Table I shows the variability in arith- 
metic ability on each of the four tests for 
the whole group of children. 


TABLE I 


VARIABILITY IN ARITHMETIC ABILITY OF 
453 PUPILS IN GRADES 4A-5B 











(Expressed in C-Scores) 
Mean Range S.D. 

Computation 63.9 37.5 to 91.5 | 8.6 

Problem Solving 63.6 27 to 96 10.2 

Vocabulary 64.5 | 24.5 to 103.5 /10.8 
Quantitative 

Relationships | 65.6 23 to 92.5 | 9.4 
P.E. of tests | 

approximately | +2.0 











It will be observed that the mean 
scores on all four tests were practically 
the same. However, the range in scores on 
each of the tests was very large. Expressed 
in terms of grade differences the range be- 
tween the highest and lowest scores is 
equivalent to more than five grades for 
each test. ; 

The question naturally arises, what are 
the intercorrelations among the scores on 
the four tests? The answer to this ques- 
tion is given in Table II. on the following 
page. 

The correlations between general intel- 
ligence and the various arithmetic test 
scores are all positive. They range from 
-350 between I.Q. and computation to .766 
between I.Q. and vocabulary. The range of 
the skills measured by the test of computa- 
tion is much narrower than the range of the 
learnings measured by the vocabulary test. 





1. Analytical Scales of Attainment in Arithmetic. Educational Test Bureau, Inc., Minneapolis, Minn. 
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TABLE 112 


INTERCORRELATIONS OF ARITHMETIC 
ABILITIES AND GENERAL INTELLIGENCE 
] T T 
| | Prob- | Quantita- 

Compu- | lem Vocabu- | tive 
jtation | Solv- lary | Relation- 

|_ ing 3 ships 

596 | 602 
631 417 
| .576 
| .661 








1.92 | «350 
Computation : 


- 766 
361 
-691 522 


-361 


Problem Solvin 

Vocabulary 

Quantitative 
Relationships .417 


| 2522 


| 
| 
} 
| 
z 


6. | 


——eeS 7 = 


|_.576 





®yental ability based on Von Wagenen Unit Scales 


of Apti- 
tude; mean I.C. of group .93. 


The correlation between I.Q. and problem 
solving, a more complex ability than compu- 
tation, is .506 and between I.Q. and under- 
standing of quantitative relationships, a 
trait more complex than problem solving, 
.602. The correlations with general intel- 
licence tend to increase with the increase 
in the complexity of the traits involved. 
The correlation between computational ability 
and vocabulary, .361, and between computational 
ability and understanding of quantitative rela- 
tionships, .417, are both quite low. These 
carrelations show that a pupil's rating on the com- 
putational test is a very unreliable index of his 
ability as measured by the vocabulary or quan- 
titative relationships tests. Tests of each 
trait are essential for well rounded appraisal. 
The general conclusion may be drawn 
that the more specific the traits involved, 
the lower are the correlations; the more 
complex the traits, the higher are the cor- 
relations. If tests were given to evaluate 
each of the specific outcomes that could be 
listed under an analysis of the four func- 
tions, it is probable that the correlations 
between many of the elements would be quite 
low, depending on the narrowness of the 
traits involved. 
The results here reported show that for 

particular individuals there are marked 
ifferences in the levels to which the va- 
rious traits have developed as a result of 
the learning experiences in which the pu- 
pils have participated. If the emphasis in 
instruction has been on the computational 


| cannot be left 
| cannot be regarded as a by-product of 








| Computation 








Leo J. Brueckner 


function we can be quite certain that the 
development of the traits included in the 
other functions will be uneven and unbalanced. 
To make certain that growth of all of the 
desired traits takes place, direct provision 
must be made for educative experiences in 
which they may function. Their development 
to chance circumstances and 
in- 
struction which is at present largely lim 
ited to the development of skill in computa- 
tion. 

To get a picture of the variability of 
the abilities of bright and dull children 
as measured by these four tests, the results 
for two groups of children, one consisting 
of 117 children with I.Q.'s of 100 to 129, 


| and the other consisting of 116 children 


with I.Q.'s of less than 85 were compared. 
Table III gives the means of the two groups 
on each of the four tests and the ranges in 
the scores. 


TABLE III 


VARIABILITY OF ARITHMETIC ABILITY 

IF BRIGHT AND DULL CHILDREN 

S.D. 
Bright Dull 





Means 


Ranges 
Bright Dull 


Bright Dull 


68.7 58.4 55-86 37-76 
Problem 

Solving 71.2 58.2/| 45-96 37-77 
Vocabulary 74.4 55.4 |515-98 24-71 
Quantitative Rela- 


tionships | 73.4 58.5 49-90 


22-79 





The mean scores for the bright group 
in each case were larger than for the dull 
group. However, an analysis of the ranges 
for the two groups shows a wide overlapping 
of ability. In two tests, vocabulary and 
quantitative relationships, the dull group 
appears more variable than the bright, 
while in computation and problem solving 
the bright group appears more variable than 
the dull. The only significant difference 
in variability of the two groups is in the 
test of quantitative relationships where the 
dull group varies significantly more. The 
differences between the bright and dull 
groups in specific traits would undoubtedly 


. From an unpublished master's dissertation by Virgil Walker, "Interrelations of Arithmetic Abilities", on file at 


University of Minnesota Library. 
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be increased by intensive training; how- 
ever, it seems quite clear that both groups 
would profit greatly from systematic in- 
struction dealing with the different traits 
involved in the four major functions of 
arithmetic. 

Judging from the overlapping of abili- 
ties of the two groups there is no reason 
to assume that the work in arithmetic for 
dull children need be limited to practice 
on computational processes as is now the 
common practice; the evidence is clear that 
they can profit greatly from participation 
in activities which deal with the social 
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and cultural values of arithmetic. 

The wide range in the abilities of 
these children makes it clear that instruc- 
tion must be organized in such a way that 
the difficulty of the work may be adapted 
to the needs of the individuals. Even if pv- 
pils are grouped according to their mental 
ability the type of class organization for 
both superior and inferior groups must make 
full provision for individual differences 
in view of the wide range of the abilities 
and needs of the pupils in the two 
groups. 
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RESULTS OF USE OF MACHINES FOR TESTING AND FO 
DRILL, UPON LEARNING IN EDUCATIONAL PSYCHOLOGY 
by 

James Kenneth Little 
University of Wisconsin 


The experimental study of problems of 
college teaching is a development quite 
largely of the last decade. However, these 
investigations have already yielded sugzes- 
tions for the improvement of college in- 
structional procedures which should, if ef- 
fectively combined, markedly increase the 
efficiency of college work. A still newer 
development is the invention of various de- 
vices for automatically scoring tests and 
giving drill. Such devices should facili- 
tate experimentation at the college level. 
Certain of them seem (as will be pointed 
out shortly) peculiarly suited for follow- 
ing out certain of the suggestions referred 
to above for the improvement of college 
work. 
vestigation reported in this paper was to 
see what might be accomplished, using cer- 
tain of these testing and drill machines, 
to follow up some of these suggestions for 
the improvement of instructional efficiency, 


Materials and Subjects 





Previous investigations of college in- 
structional problems had (the writer felt) 
emphasized the following relevant points: 
(1) the motivating effect upon the learner 
of knowledge of standing and progress; 

(2) the value, both for motivation and for 
guidance in learning, of informing students 
specifically and immediately of their er- 
rors and their successes in their work; 

(3) the value in the above (and other) con- 
nections of the frequent short test, as 
contrasted with less frequent longer tests 


| error pattern and difficulty; 


or examinations; (4) the great importance 
of continuous adjustment to individual dif- 
ferences not only in capacity but also in 
(5) the value, 
in all these connections, of a consistent 
use of the make-up test. 

Two pieces of apparatus available to 


| the writer appeared especially useful in fa- 


cilitating an instructional program develop- 
ing the above points. The first was a de- 
vice for the immediate automatic scoring of 


| objective tests and tabulation of results 


| by item.® 
_ an instructor to score a 30-question 
sheet in less than a second. 


The very practical problem of the in- 


| glance at the item-tabulation dials 


This machine made it possible for 
test 
Immediate ly 
after he had finished a test a student could, 
therefore, know his score. And as soon as 
the last student's paper had been run 
through the machine the instructor could 

and at 


| once discuss with the class those questions 





| false questions). 


on which most mistakes had been made. The 
second or drill machine was a small device, 
one of which was given each student for use 
during the drill period. The student indi- 
cated his answer to each question by press- 
ing one of five keys (one of two, for true- 
If the answer was correct 
the drum of the machine turned up the next 
question; if incorrect the same question re- 
mained before the student, and he was thus 
required to find the correct answer to each 
question before proceeding to the next. 
Meanwhile, a counter dial counted the to- 
tal number of tries made in completing the 
test. The students were shown how to re- 
set the machines, and were expected to go 





1. The paper summarizes a thesis submitted in partial fulfillment of the requirements for the doctor's degree at the 
Ohio State University. The writer wishes to express his obligations to Dr. S. L. Pressey for his assistance in 


this investigation. 


2. For a description see S. L. Pressey, "A Third and Fourth Contribution toward the Comirg ‘Industriel Revolution’ in 
Education", School and Society, XXXVI (November, 1952). 
proved in certain respects, as in the use of hand punches for marking the answer slips, and ir the addition of a 
mechanism which automatically printed each student's score on his paper. 


As used in this investigation the apparatus had been im- 
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through each test until they did so with no 
errors, } 

The investigation was carried on dur- 
ing the winter and spring quarters in the 
5 hour required quarter course in educa- 
tional psychology, and dealt with the regu- 
lar work of the course.* The course was 
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taught in sections averaging 30 students, 
by five different instructors; in all, 14 
sections took part in the experiment. The | 
students were mostly freshmen and sopho- 
mores. 
The course is divided into 14 teaching | 
units; for each, two 30-item true-false 
tests were available. There were also used 
a 140-item pre-test (reliability .80) cover-| 
ing the entire course and given to all sec- 
tions at the beginning of the quarter, a 
100-item selective answer (5 choice) mid- | 
term covering the first half of the course, | 
a similar 100-item test covering the sec- 
ond half, and three broad essay type ques- 
tions sampling from the entire course. In 
working up results the mid-term and final 
objective examinations were combined and 
considered one final test (reliability .92).| 





A calendar was prepared at the begin- 
ning of each quarter. This required that 
the sections keep together, spending an 
average of about three days on each teach- | 
ing unit, and that each test be given on 
the same day. Twelve such tests were 
scheduled, the last chapter of the first | 
half of the course being included in the | 
mid-term, and the last test of the second 
half in the objective final. | 

| 
| 
| 


| 
| 
| 


Each quarter, two sections of the 
course were run as “test machine groups", 
two as "drill machine groups", and the re- 
maining three sections as control groups; 
there were thus in all four test sections, 
four drill sections, and six control sec- 
tions. So far as possible, sections of 


each type were rotated among the instruc- 


| material. 
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tors and times of day so that differences 
in instructor efficiency, time of day, etc., 
would be equalized; and careful examination 
of the data would indicate that such fac- 
tors were largely negligible or thus equal- 
ized. The pre-test, and the University In- 
telligence Test (given to all entering stu- 
dents), made it possible to pair cases as 

to ability and preparation. 

All sections took pre-test, mid-term, 
and finals. In the test machine sections 
each one of the twelve unit tests was 
scored by the scoring and tabulating ma- 
chine; as each student finished punching 
his answer slip he brought it to the instruc- 
tor, who at once scored it, checked wrong 
answers (very easily done when the slip is 


| in the machine),® tabulated the score on a 


tabulation form already on the board, and 
returned the slip to the student. When all 
had finished, the test was discussed, the 
instructor directing the discussion toward 
the items which the item-tabulator revealed 
as most frequently missed. Each student 
thus at once knew his score. Since all the 
tests were 30 questions long he readily 
sensed how far he was from mastery of the 
From the distribution on the 
board he could see what his relative stand- 
ing was; he could also see his letter-grade 


| mark, since these were also indicated on 
| the distribution. 


Further, all students 


| who on the first unit test scored below B 


were required to take a make-up test (the 
duplicate form of the first test) on the 
following day; and this was scored and dis- 
cussed in similar fashion. The final mark 
on the unit was then taken as the average of 
the first and make-up tests. The ways in 
which this procedure provided motivation by 
immediate knowledge of results and progress, 
informed of errors, and adjusted to indi- 
vidual difference, will be evident. 

In the drill sections the same general 
procedure was used. But the tests were 
taken using the drill machines. The score 
the first time through was tabulated as 





l, For a description of an early form of this machine see S. L. Pressey, "A Simple Apparatus which Gives Tests and 
Scores—and Teaches", School and Society, XXIII (March 20, 1926). 


2. Various preliminary investigations were made, using nonsense syllables, German vocabulary, and unfamiliar Tartar 





history. 
ful. 


They suggested progressively greater value of the procedures used according as the material was meaning- 
Because of this, and because of the greater practical significance of experimenting with reguler subject mat- 


ter, the major effort of the experiment was therefore directed as above described. 


5. A scheme is now being worked upon, for automatically marking rights, where a mistake has been made. 
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the student's score on that test; but the | the upper part of the distributions is the 
student then reset the machine and repeated | most striking feature of the results. Thus 
the test until he could go through it with- (it will be noticed that on the objective fi- 
out error. The drill sections thus had the nal test Q, of the drill group is 19 points 
advantages of the test section, plus re- above Q, of the control (7 points above the 
view with automatically required correc- median of the control), whereas the experi- 
tions of error, and opportunity for im mental Qs; is only 5 points above the corre- 
mediate progress to mastery of that test. sponding control. On the objective test, 

The control sections took the chapter (the drill group shows greater gains than the 
unit tests by marking on an answer slip test group. On the essay type final this 
which was graded by the instructor that | difference also appears, but all differences 
evening and returned the next day. There |are less. This is probably due in part to 
were no make-up tests. The control sec- the unreliability of the grading of essay 
tions thus did not have immediate knowledge | type questions. At the bottom of the table 
of results (they did know results the next | are show the differences between experi- 


day, and the cards had errors checked so mental and control medians, the probable 
that they could locate mistakes). They errors of these differences, and the differ- 
did not have the stimulus of possible ex- ences divided by the probable errors. It 
cuse from the make-up if they did well on should be again emphasized, however, that 


the first test, the motivation involved in | the most important differences are at Q@ 
the possibility of improving their mark if | rather than at the medians, and that the re- 
they did poorly in the first test, or the sults, and their reliabilities, are to be 
help which taking the make-up and locating evaluated from an inspection of the entire 
any errors might give them. Nor did they distributions rather than from any one com- 
have the benefit of the diagnostic reviews | parison. 

with measure of progress provided by the 

drill machine. The control sections had 

this possible advantage, that the extra Minor Exploratory Investigations 

time taken by the testing and the make-ups, 

in the experimental sections, could in the 

control sections be devoted to teaching and | 





class discussion. Certain special studies were made in an 
| attempt to estimate roughly but more specif- 
Results | ically the effect of some of the motivation- 
/al and instructional techniques. During the 
In handling results, each student in | spring quarter, one chapter test was re- 
an experimental section was paired with a peated in all sections just before the make- 


student in a control section on the double up to see the effect of immediate discussion 
basis of score on the pre-test and on the and drill in the experimental group on these 
intelligence test; pairing was fairly close | same test items. The experimental groups 
(within five points) and in consequence the | gained one point only in the median but 
following data do not of course include all | there were considerably fewer very low 


cases. The table on the following page scores. 

shows the combined (winter and spring A test compounded from the most diffi- 
quarter) results for the "test machine" cult items of tests occurring early in the 
group and its control and the "drill ma- course, and another similarly compounded 


chine " group and its control, on both the from the later tests, were given to see 
200-item objective final tests and the 45- | whether the experimental groups had especial- 
minute, 3 question essay final examination. | ly benefited, as indicated by improvement 
Not only the medians, but also Q,, Q;, and on these questions. Results were not en- 
the entire distributions are given since tirely consister:t, but gains were greatest 
the differential effect of the experimental | in the experimental groups. 

procedures upon the lower as compared with Study was made of the effect of the 
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TABLE I 


PAIRED CASE RESULTS, SPRING AND WINTER QUARTERS COMBINED 


End of quarter differences between (1) test machine group and its control, and (2) drill machine 
group and its control, cases being paired on the basis of (a) intelligence and (b) a pre-test on 


the subject matter of the course. 








200-item multiple choice final test 














A 45-minute, 3 question essay type final 





























examination 
‘ietiscetsctucseaaaasis amie sdeanicl ini - Perna: Pes 
_ TEST Ss we | __—TEST DRILL ; 
Exp.| Con. Exp. Con. Exp. | Con. Exp. | Con. 
185-9 1 1 60-2 
180-4 | 1 1 2 1 57-9 1 
175-9 | 1 2 2 1 54-6 4 1 1 
170-4 | 8 7 4 3 51-3 1 2 2 1 
165-9 | 8 6 6 4 48-0 5 7 4 
160-4 lk; 3 %%. 5 45-7 5 4 5 4 
155-9 9+— 6—Q5 9--+— 8—-Q3, 42-4 13« 3 1l« 3 
150-4 10*; 5 8G 5 39-1 7--++-11—Q3 7: 6 
145-9 10 | 13 10: | 1 36-8 10 ll 7¢+— 8—Qs 
140-4 12-++—- 8—-M 6<- | 4 33-5 10< yg a 12 
135-9 6<; 7 34+ 4—M 30-2 11-—-+—- 9—-M 14<+— 9—M 
130-4 6 : ll 3 i] 8 27-9 14< 10 a ; 5 
125-9 9++—9—Q) 2 i] 8 24-6 5-—+-11—Q, 4—+—- 5-Q) 
120-4 4 5 3—1-6—Q, 21-3 4 7 2 5 
115-9 1 4 3 3 18-0 6 5 3 3 
110-4 1 5 1 3 15-7 7 1 7 
105-9 | 4 4 12-4 1 2 3 
100-4 3 2 9-11 1 1 
95-9 1 6-8 3 
90-4 3-5 
85-9 1 0-2 
N= 99 99 diff 71 71 diff N = 99 99 diff 71 71 diff 
Qs= 163 156 7 #163 158 5 Qs= 43 40 3 44 38 «COG 
M = 150 141 9 153 136 17 M = 35 32 3 37 32. C«* 
Q,= 138 127 11 148 124 19 Q= 28 25 2 30 24 6 
ME - MC 9 17 3 5 
PE of Diff 1.92 2.34 1.06 1.17 
Diff/PE 4.69 7.26 2.83 4.28 





(ME = median of experimental group; MC = median of control group.) 


three trials by drill machine; it might be 
supposed that students having once been 
apprised of their specific errors, and hav- 
ing access to the textbook for reference, 
would on a second or third trial be able 
to answer a test with a perfect score. 
However, median scores of a class on three 
trials on a representative chapter test 
were 23, 27, and 29. The range of scores 
progressively narrowed and the scores were 
bunched at the top on the third trial; but 
the slowness with which this improvement 
took place, in view of the opportunities 
at hand, was rather striking. Students 
missed items on the second and third tri- 
als which they answered correctly on the 





first. The results are an interesting sug- 
gestion as to the probable slow and "de- 
velopmental" character of learning. 

An experimental section and a control 
section, both taught by the same instructor 
and roughly equal in intelligence and 
previous knowledge of the course, were com- 
pared in terms of the grades received on 
the first test given on each chapter. Pre- 
sumably, neither of the experimental groups 
would have the advantage of the special ex- 
perimental procedure of reviews and di- 
agnosis in such a comparison, but more gen- 
eral motivating effects of the total pro- 
cedure (plus cumulative gains in prepara- 
tion for each succeeding unit of werk) 
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might appear. It was found that approxi- 
mately twice as great a percentage of the 
students in the control group received D's 
and E's as in the experimental group, and 
fewer made A's and B's. The same general 
results obtained in comparisons of other 
such classes. 

Questionnaires designed to secure stu- 
dent criticism and comment, upon the pro- 
cedures used, were given in all sections. 
In general, the students reported that they 
preferred to take tests by the methods used 
rather than by the usual methods; they con- 
sidered make-up tests desirable and indi- 
cated that as a general practice they stud- 
{ed chapters further in preparation for 
them. They voted more than 4 to 1 that 
there was not too much testing, and men- 


tioned frequent tests as one of the most de- 


sirable features of the procedure. Only 
one student reported that he had ever taken 
a course before in which his test results 
were regularly made known to him and dis- 
cussed on the same day as the tests were 
given. They reported their marks to aver- 
age about the same, if not a little poorer, 
in the course than in other five hour 
courses, although they thought the course 
was somewhat easier. Cheating was re- 
ported, but few reported that it was more 
than is usual in college classes. They 
voted 9 to 1 that their total reaction to 
the procedure was quite favorable. The most 
undesirable features of the procedure, in 
the student's estimation, were tricky, am- 
biguous test questions, and too high stand- 
ards for grades, particularly for A's and 
B's. Control sections complained that they 
should have had make-up tests too. A few 
in the drill machine group were discon- 
certed by machines which operated faultily, 
and a very few were annoyed by the noise 

of the machines. Some thought that there 
should have been more essay tests, and some 
asked for more time for class discussion. 


The Larger Significance of the Study 





The investigation here reported was 
planned as a very practical class experi- 
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ment. A variety of factors which might 
presumably contribute to increased effi- 
ciency were therefore included; and analysis 
of the contribution of eaci is hardly pos- 
sible. However, two general conclusions of 
very considerable importance may, the writ- 
er feels, be drawn. In the first place, 
the experiment is (so far as the writer 
knows) the first instance of the systematic 
use of machines as part of the regular 
routine in regular university classes. It 
demonstrates (he feels) the possibility of 
mechanical aids in education. 

In the second place, the experiment is 
in interesting contrast to present trends in 
efforts to improve college instruction. The 
present trend seems to be toward the expen- 
sive tutorial system, toward still more ex- 
pensive housing plans, and toward somewhat 
transcendent or sentimental emphasis upon 
the importance of "personality" in teach- 
ers. In contrast, one purpose of this ex- 
periment has been to show that an impersonal 
attempt to organize procedures in college 
instruction for greater efficiency may pro- 
duce demonstrable results and at the same 
time save labor. In this time of depression 
such a saving may be of some interest. 


Conclusions 


(1) Students immediately apprised of their 
test results, and given opportunity to 
correct deficiencies by make-up tests, 
profit markedly in terms of final exam- 
ination results over students who do rot 
have such advantage. 

(2) Students immediately apprised of the 
correctness or incorrectness of their 
responses to each item of a test, and 
given opportunity to correct deficiencies 

’ by drill and by make-up tests, likewise 
so profit. 

(3) The greatest benefit accrues to stu- 
dents who usually score in the lower 
half of the distribution, although the 
entire groups move upward. 

(4) Mechanical test-scoring and drill de- 
vices have a practical use in the class- 
room. They are convenient for students, 
time- and labor-saving for teachers, 
and make possible instructional tech- 
niques not otherwise practicable. 
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ELEMENTS OF CHORD-READING IN MUSIC NOTATION 


by 


Otto Ortmann 
Department of Research 
Peabody Conservatory of Music 


The following test was undertaken as 
an initial step in a comprehensive study of 
the problems of music reading. In no way 
is it intended as a measure of music read- 
ing ability. It merely isolates one phase 
of such reading and attempts a diagnosis of 
the principles underlying this phase. 
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The element selected was the vertical 
grouping of notes as this is found in va- 
rious chord-forms. A series of "flash- 
cards" (Fig. 1) was used in classroom pres- 
entation. The size of these cards was de- 
termined by approximating the ratio between 
note-size and eye-distance found under nor- 
mal music-reading conditions. (The normal 
distance of the eye from the note-rack va- 
ries between 20 and 30 inches.) The staff 
lines used were 7/8" apart, the long di- 
ameter of the elliptical note-heads was 1". 
Preliminary testing indicated that for nor- 
mal vision this size was sufficiently large 
to present a clear visual stimulus for 
classroom use. In order to eliminate all 
factors other than the spatial distribution 
of notes themselves, clefs and stems were 
omitted and black note-heads used. Ata 
"ready" signal each card was exposed for 
2/5 seconds under adequate light conditions. 
The pupils, immediately after each presenta- 
tion, wrote on staff paper "what they had 
seen.” After completion of the test, they 
were asked to comment, so far as possible, 
on the methods used and on any particular 
points of ease or difficulty. A tabulation 
of the replies revealed the presence of def- 
inite measures of approach and also of par- 
ticular visual difficulties. 


Number of Notes 





The functioning of visual-span in read- 
ing, or, for that matter, in any visual per- 
ception, is well known. Consequently, its 
operation in the reading of music symbols 
may be expected. The average number of er- 
rors in 27 papers for one-note examples was 
O; for two-note examples, 2; for three-note 
examples, 3.4; for four-note examples, 13.7; 
for five-note examples, 27.5; for six-note 
examples, 42.3. For the pupils tested, all 
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advanced students in msic, reading diffi- 
culty, other things equal, begins with the 
four-note examples. The steady increase of 
4ifficulty with an increase in number of 
notes is shown in Fig. 2. 


aw 
errers 





zs * § 
Figure 2 

The actual number of notes is seldom 
missed. Many of the omissions found for 
certain examples are the result of failure 
to write a note when in doubt as to its ex- 
act staff-location and not from failure to 
read correctly the total number of notes 
present in the stimulus. "Sometimes I 
didn't put down any note because I wasn't 
sure just what note it was", indicates this 
reaction. This was noted by several stu- 
dents. 

Six-note groups were found difficult 
(with the given exposure time) for even ex- 
perienced sight readers of decidedly supe- 
rior ability. Since, in a moderate Allegro 
tempo, more than six notes are very often 
correctly read in 2/5 seconds, it follows 
that some other form of grouping than the 
note-reading must function. Indications of 


+h 4 


is are given in later paragraphs. 
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Vertical Distance 





A second determinant of eye-span is 
e spatial area. Since, in the present 
only the vertical dimension was used, 
area determinant can be measured by 
vertical distance. If we array the test 
data on this basis, in a gross way, we get 
the following distribution for average er- 


rors: 
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| outlines, such as: 


51 


The presence of other factors, yet to 
| be considered, obscures somewhat the corre- 
lation between vertic distance and diffi- 
| culty of reading, yet the given table indi- 
| cates a clearly marked general increase. If 
| we select examples, such as Models 10, 13, 
| 18 and similar cords, the correlation 
| between vertical distance and number of read- 
ing errors is marked. Using 4-note exam- 
| ples: 
Vertical distance .. 4th Sth 6th 
Average errors ..... O 8 9 


7th 
13 


Vertical distance .. 
Average errors ..... 


10th 13th 
22 28 


16th 
42 


Again, if we take similar chord-forms 
| used in both close-position (small vertical 
distance) and open-position (wide vertical 


| distance) the greater number of errors is 


| always found for the greater distance. Thus 
| errors 


for M. 38 were three times as 
quent as those for M. 19. 

By restricting the count to specific 
intervals of various models having similar 
Models 37, 42, 11, the 
errors are respectively: O, 8, 14; Ms. &, 
10, 18 with errors 5, 12, 19. Both series 
indicate an increase of error with an 
crease of vertical distance, although ‘ 
factor of chord-unit, discussed later, a1: 


fre- 


in- 


_ affects this distribution. 


Additional proof of the distance di! 
ficulty is given by the introspections of 
the pupils, fifteen percent of whom wrote: 
"wide intervals are harder than close ones", 


| volunteering the information without any 


leading question from the examiner. 
Expansion and contraction. The verti- 
cal distance of the visual pattern also af- 
fects the frequency with which the distance 
is overestimated (expansion) and under- 
estimated (contraction). Measured by the 
extreme notes (total vertical distance) 
this was overestimated in 30% and underes- 
timated in 70% of the total number of 








Vertical distance of a fifth or less 


between Sth and octave 
more than octave 


" " 


2-note 3-note 
0 1.9 
0 1.0 
3.6 


4-note 5=-note 
4.5 
10.5 


6-note 


30. 
38.5 


27.5 
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examples. In the 4-note examples embracing 
more than an octave the errors were: exp. 27%, 
contr. 73%; for those less than an octave 
44% and 56%. This indicates a general 
tendency to underestimate the vertical dis- 
tance, the tendency increasing with an in- 
crease in vertical distance. In examples 
such as Ms. 7 and 27 it is very pronounced, 
the former was contracted in 89% of cases, 
the latter in 94%. 

The effect of vertical distance on the 
overestimating or underestimating of pat- 
terns, however, is also somewhat modified 
by the particular note distribution. In- 
tervals, such as sevenths may be written as 
octaves because of the familiarity with the 
harmonic significance of the latter. An 
open position, such as M. 38 may be recog- 
nized as a triad and thus written correctly 
in spite of the wide vertical distance. 


Form of Visual Pattern 





Notes written in standard staff nota- 
tion necessarily take on various patterns 
as a result of the spatial distribution of 
the notes. Such patterns are illustrated 
by the models given in Fig. 1. Examples 
like M. 2, 5, 9, 10, etc., not only have a 
vertical dimension but a horizontal dimen- 
sion and, consequently, they best serve to 
illustrate the extent to which pattern-out- 
line is observed in reading notes. The in- 
structions of the test did not mention pat- 
tern, the pupils were free to write the 
notes in any gpatial relationship desired. 
A number of combinations for such examples 
as 9 or 17 are possible and all would be 
correct from a notation standpoint. 

For the single vertical lines (such as 
M. 1, 3, 7, etc.) the form was retained in 
100% of replies; for the mixed groups 
(oblique relationship or two verticals) the 
correct form was retained in 90% of replies. 

More significant than this, as indi- 
cating the importance of visual pattern in 


note-reading, is the fact that the geometric 


outline was retained even when reading er- 
rors occurred. The following illustrations 
are typical; the large notes are the models, 
the small notes are the written versions: 
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Figure 3 


They show, in some cases, complete re 
tention of the pattern outline; in others, 


retention of the vertical and oblique as- 
pects with errors in the details. The visual 
pattern is correctly reproduced but the re- 


lationship between individual notes and the 
staff is incorrect. 

If this be true, then the simpler pat- 
terns should be easier to read. Simplicity 
or complexity is determined by the linear 
relationships. When the notes of an exam- 
ple appear to form parallel lines in both 
dimensions as in Models 11] and 42, the 
group is more readily perceived than when 
they do not form parallel lines in both di- 
mensions, as in Ms. 10 and 13. Several ex- 
amples permit a comparison on this basis. 
For the 4-note examples: 


Parallel structure Ms. 11, 37, 42; 
average error 7.3 

Mixed structure Ms. 10, 18, 34; 
average error 10.3 


M.9, on account of the number of tones 
present, was a difficult example with many 
errors, yet the two verticals and three as- 
cending obliques (left to right) were 
changed in only 3.7% of cases. 

Further evidence of the significance of 
the visual pattern is found in the intro- 
spections of the pupils: "I saw the notes 
as a design or pattern." "Names of notes 
not thought of, worked on pictures of the 
groups presented.” “Shape of chord-forms 
was helpful." "Some formed single vertical 
line, others two diagonals, still others 
had one note on a second vertical line." 

Since all visual stimuli are grouped 
on the basis of proximity in space, the 
patterns used will necessarily show 
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subdivision into smaller groups according 
to the manner in which the notes are dis- 
tributed. Examples, such as Fig. 4a, are 


not subdivided on the basis of spatial re- 
lationship; the remaining examples of 

Fic. 4 are so subdivided, the grouping in- 
to two units increasing from b to f. 


This 


spatial grouping, which is a basic visual 
reaction, is interfered with by the verti- 
cal reading (two fourths or two fifths, 
etc.) resulting from the purely vertical 
construction of all chord-forms in music. 
Roth types of reading were used by the pu- 
pils. Vertical type: "Noted whether in- 
tervals were seconds, thirds, etc. Also 
watched for octaves." "Even in No, ll, I 
read two perfect fifths." Spatial type: 
"Some chords, such as No, 1l, divided them- 
selves into two close groups, seconds.” "In 

. 11,1 divided it C going to D, and F to 
ag 

In examples of only one vertical line, 
notes will also be grouped through spatial 
proximity. Fig. 5 shows the distribution 
of errors for the given examples: large, 
black notes are the stimuli, small, black 
notes the incorrectly written notes; small, 


white notes are the omissions. 
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The eye is at first attracted to the 
portion of the pattern with closest notes; 
at a this is the upper third, at b it is 
the third in the middle. As a result, there 
are fewer errors for these parts than for 
the remaining parts. This is especially 
noticeable in b. In M. 14 two pupils 
wrote: "The top two notes grouped them- 
selves together." Another general intro- 
spection was: "First saw centre of group 
where most notes were together.” (Example 
14 is an instance in which the ascending 
reading is interfered with by the high lo- 
cation of the visual centre of gravity.) 
For the same reason, we find, as typical 
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variants of the following oxample, such 
distributions as in Fig. 6. 
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Figure 6 

The “togetherness” of notes applies 
chiefly to seconds and thirds, since, in 
both cases, the notes are adjacent, hence 
without open space between. With the in- 
terval of a fourth, the two notes compris- 
ing the interval are separated by a white 
space, hence the grouping is less strong. 
The degree to which it then functions is a 
relative matter, depending upon the dis- 
tribution of the other notes of the chord. 
In M. 7 the upper two notes are grouped be- 
cause the interval of a fourth is the small- 
est interval in that example. In M,. 18 the 
two lower notes (also a fourth apart) are 
not grouped because the remaining intervals 
are smaller. Grouping, therefore, varies 
with each chord. 

This variation does not hold for the 
more fundamental grouping of seconds and 
thirds into “close” and all other intervals 
into "open", which remains the basic di- 
vision for notation. 

However, if an open fifth is "filled 
in" with an additional third, the three 
notes become a unit. Compare, in this re- 
spect, M. 39 with M. 19. For a similar 
reason, the interval of a seventh, open in 
M. 14, may become "close" by filling in 
two thirds, as in M. 16. With such addi- 
tions a visual centre appears, whereas in 
the two-note open intervals there is no 
visual centre; they remain visually un- 
grouped. When the distance is "filled in" 
the visual centre is along the middle of 
the chord. 

The ease, therefore, with which ver- 
tical notes are grouped, depends upon the 
spatial distribution. 

Number of Verticals. In their remarks 
several pupils noted that: "Examples with 
more than one vertical line are difficult." 
This difficulty occurs when the verticals 
do not agree with familiar chord-forms. 

M. 9, read vertically, is difficult on this 
account; read obliquely, as three seconds, 
it is considerably easier. In such cases 
the model is frequently changed into fam 
iliar chord-units as in Fig. 7 written for 
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Figure 7 
M. 9. And, conversely, when the verticals 
are familiar chord-forms, the vertical 
reading is advantageous. 

In other examples the outline suggests 
definite grouping. In M. 20 the grouping 
is three thirds. Yet if this example be 
written as in Fig. 8, any version of which 
is correct notation, the division into 
three equal groups is entirely lost, anda 
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Figure 8 
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complex grouping results. Thus the diffi- 
culty of the example is increased. 

It follows that when chord-groups are 
printed they should contain as many repeated 
elements as possible (Fig. 12a on page 56) 
which are somewhat easier ihan the forms 
shown in Fig. 12b on page 56. Sometimes, 
an even arrangement is impracticable, but 
within certain limits, it can be done. 

The test results show a general tend- 
ency to read from the bottom note up. In 
Fig. 9 the error and omission distributions 
are given for various examples, including a 
few widely dispersed examples. These are 
selected because the wide vertical dis- 
tances will reveal more clearly the direc- 
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tion of reading. If this be from the bot- 
tom note up, the greatest number of errors, 
including notes and especially omitted 
notes, will be found among the upper notes. 
(See Mo. 7, 20 and 36.) 

To the question "Did you read all the 
notes at once, or did you read from the top 
down or from the bottom up", 96% replied, 
unequivocally "from the bottom up", modify- 
ing it with the remark that for close 
groups, such as Nos. 10, 42, etc., the di- 
rection of reading was not noticeable. 

This preference for ascending reading 
may readily result from associations with 
the construction of all chord-forms in 
music, which is invariably in an ascending 
direction. Since all the pupils taking the 
test were thoroughly experienced in the 
writing of chord-forms, the association, 
naturally, was strongly developed. On the 
other hand, it does not follow that the as- 
cending reading, per se, has any advantage 
over a descending reading. The obvious ad- 
vantage accrues from the readiness with 
which it fits into other notation experi- 
ences. In the reading of Ex. 9, for in- 
stance, the reaction "two ascending ad- 
jacent pairs of fourths on E - F", would 
involve concepts very familiar to the stu- 
dent and would include all the relationships 
among the notes of this example. Whereas, 
the reaction: "Six-notes, the bottom one 
on bottom line, next one on bottom space, 
next one on second space, next one on third 
line, etc.", obviously is so complex as to 


| preclude its functioning effectively in 





the short exposure time. It is, therefore, 
natural to employ the interval and notation 
concepts already established, and since 
these are all ascending in nature, they ex- 
plain the ascending reading. In view of 
the number and strength of these associa- 
tions, it is probably wise to encourage as- 
cending reading independent of the particu- 
lar clef used. 

However, as mentioned under Visual 
Centre, a close grouping of notes at any 
level, fixes the eye at this level. In 
Ex. 18 the note most often incorrectly read 
is the lowest. The omissions still point 
to an ascending reading. The direction, 


therefore, seems to remain, but the visual 
centre focuses the eye and draws it away 
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from the lowest note before this has been 
adequately read. When it is then written, 
guess substitutes the octave of the highest 
note, a combination found more frequently 
than the given example. This is borne out 
by the introspections: "watched for oc- 
taves", "it hurt to write intervals such as 
ninths, and I always wondered whether it was 
an octave instead." 


Cc ts 


In some ways the most important element 
in chord reading is the higher-unit group- 
ing on the basis of chord-construction. That 
is to say, the recognition of a note-group 
as a single chord-unit, regardless of the 
actual number of notes involved. Chords 
are constructed in thirds. The smallest 
form is the triad, consisting of two thirds, 
one immediately above the other. Since 
thirds are “close” intervals in notation 
(notes touching each other) patterns such as 
Models 19, 21, 25, 30 are units both from 
harmonic construction and from visual group- 
ing. In such cases, the theoretical and 
the visual reactions coincide, with the re- 
sult that such examples are the easiest to 
read. Not a single error in reading "tri- 
ads" occurred in the test. The ease of 
reading is independent of the exact posi- 
tion of the group on the staff. 

The next larger chord-unit is the 
seventh-chord, which consists of three as- 
cending thirds (Ms. 16, 23). Here again 
the theoretical and visual reactions coin- 
cide, and the frequency of reading error is 
again low: 3%, 

A slightly less familiar chord-unit 
(first inversion of a triad) is given in Ms. 
1, 41. When these examples are compared 
with other 3-note examples which are not 
chord-forms, such as Ms. 2, 12, 15, the ra- 
tio of reading errors was found to be as 3% 
to 15%. When the 4-note chord-forms with 
their 3% reading error are compared with 
M. 18, which has the same vertical distance 
and the same number of tones,(but is not a 
chord-unit) with its 60% error, the effect 
of chord-unit upon ease of note-reading is 
clearly revealed. It is the musical paral- 
lel of word reading as against syllable or 
letter reading. 
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The use of this higher-unit device is 
reflected in the introspections: "Read the 
notes as chords; hard when this was not pos- 
sible." “Saw these with regular forms as 
awhole: V7, II7, etc." "Found triads and 
their inversions and seventh chords easier 
than non-chord forms." “Read as chords 
whenever possible." "In No. 14 saw upper 
three notes as incomplete 7th chord." "G-B-D 
was a triad." 

When the chord-unit and the visual 
unit do not coincide, as in Ms. 13, 34, the 
reading is of moderate difficulty (38% er- 
ror). In these examples the visual unit is 
clearly marked, but it does not suggest a 
chord-unit. 

When neither visual-unit nor chord- 
unit is present, as in Ms. 4, 7, 14, the 
difficulty in reading increases further 
(74% error). Examples with wide vertical 
dispersion and non-chord units, form the 
most difficult examples. The difficulty be- 
gins to appear even with 3-note examples. 

Attempts to substitute chord-units for 
a mixed visual pattern are indicated ina 
few reactions to M.36 which was grouped 
F-B-D and G-E-G in spite of the 4-2 verti- 
cal grouping of the pattern. The tendency 
to think in familiar chord-units is re- 
flected also in the type of errors mde. 
Thus, Ms. 9 and 20 were typically written 
as at a, d, g, h Fig. 3, showing the fam- 
iliar octave substitution for the sevenths. 

A typical reaction to Model 10 was to 
write the upper two notes a step lower. This 
may result from the general tendency to 
"contract" wide intervals, but may also re- 
sult from the sbustitution of the familiar 
triad reaction, which then forces the note 
on the right into a wrong position in order 
to retain the pattern outline. 

When, in a preliminary test, the notes 
C-F-B were given, the following error dis- 
tribution occurred, showing clearly the 
substitution of a familiar note (a) for the 
less familiar (b) (Fig. 10). 
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Space and Line Relationship 





The fundamental division of notes on 
the staff is into notes on lines and notes 
in spaces. When the errors are tabulated 
on this basis we get the following ratio: 
lines written as spaces 81 times; spaces 
written as lines 106. For the same exan- 
ples, the frequency of lines to spaces is 
as 28 to 36. Assuming that the number of 
errors is proportional to the frequency of 
lines and spaces we have the following pro- 
portion 81 : 106 :: 28 : 36 or 2916 = 2948 
which is close enough to indicate an equal 
distribution of the error. In other words, 
lines are read as spaces about as often as 
spaces are read as lines. 

There are two exceptions to this rule. 
First, when the example contains only 
spaces, the error of reading a space for 
the adjacent space exceeds the error of 
reading the space for the adjacent lines. 
In other words, the entire pattern is moved 
up or down to the next space. Second, when 
only leger lines are used (as in M, 27, the 
errors of reading line for adjacent line and 
space for adjacent space exceed the errors 
line-space or space-line. (See Fig. 9, 

M. 27.) 

Taken collectively, however, the error 
of one staff degree (line-space, space-line) 
exceeds the error of two staff degrees 
(line-line or space-space). The most fre- 
quent error made is the misreading of a 
note by one step. 

When symmetrical points of a pattern 
have the same line and space relationships 
the example is easier than when these rela- 
tionships are absent. Thus Fig. lla is 
easier than Fig. 11b. 
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This, in turn, makes thirds, fifths 
and sevenths somewhat easier than fourths 
and sixths, when allowance is made for dif- 
ferences in vertical distance. And, con- 
versely, when lines and spaces are present 
in mixed order, as in M. 9, the mixplacement 
of lines and spaces becomes a typical error. 
In one instance the entire relationship was 
reversed (see Fig. 3a). 
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In this example, as well as in Ex. 17, 
many partial reversals of lines and spaces 
occurred, since the stimuli themselves pre- 
sent a mixed outline. 

A slight tendency to substitute one 
space for another (instead of an adjacent 
line) is shown in Fig. Sa and Fig. 9, M. 35, 
In both examptes a single note is rather 
widely separated from its neighbors. Conse- 
quently, its space-line position is more 
readily seen, but the exact position of the 
space or line is no clearer than when the 
notes are grouped.- Hence, the error is one 
of number of space or line rather than one 
of space-line or line-space. The converse 
of this is given by the error distribution 
for M. 42. Here, where the notes are 
closely grouped the space-space or line-line 
error is entirely absent. Similar instances 
are Ms. 9, 17, 18, 36. The fact that the 
most frequent error is that of one staff 
degree indicates that the spatial dimensions 
(vertical distance) is noted more readily 
than the line-space position of a single 
note. This is the typically increment er- 
ror common to all sensory discriminations: 
small increments are more readily misread 
than larger increments. , 


Mental Set 


If a chord-unit be anticipated and the 
stimulus by chance coincides with the an- 
ticipated unit, the example becomes easy. 
Thus M, 20 can be divided into a 4/3 chord 
on the lowest note, plus the top third, or 
as a 4/3 on (if treble clef, a) with the 
lower third. The errors for the two types 
of reading will differ since the visual 
focus will differ. It is a phenomenon sim- 
ilar to the staircase illusion and others, 
in which the mental set varies the grouping 
of the visual stimuli. In actual music 
reading this mental setting plays an ex- 
ceedingly important role. The data secured 
in this test merely indicate its presence. 
One pupil wrote: "I find myself expecting 
certain chords; when I'm right the reading 
of even complex patterns becomes easy.” 

The mental set need not be complete be- 
fore the stimulus is presented. If, in as- 
cending reading, the two lower notes suggest 
a familiar chord-unit, the mental set is 
established and the remaining notes are 
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typically forced into the expected chord- 
form. This has been experimentally shown 
in some of the error distributions of Fig.a 


Note-Names 


Although, in the test as given, no 
clef was used, hence the notes could have 
no names but were merely positions on the 
staff, yet in practically all cases names 
were assigned. In 70% of the papers the 
treble clef was definitely mentioned; an 
additional 15% indicated by chord-forms 
(v7, II7, etc.) that they were reading in 
the treble clef. The remaining 15% made no 
mention of what, if any, clef was being 
used. One paper read: "Names of notes 
seldom thought of, rather worked on pic- 
tures of the groups presented." Another 
read: "In all simpler forms I heard the 


sound of the notes"; this means that al- 
though note names may not have been present 
to consciousness, the pupil had at least 
assigned relatively definite pitches. 

It does not follow, however, in reading 
complex notation stimuli and immediately 
reproducing them on a keyboard, that indi- 


vidual notes are read. In fact, the tend- 
ency to read in higher-units, described un- 
der Chord-Units, shows clearly a reduction 
of the separate notes into a note-whole. 
But the data show that note-names are more 
concrete than memory in terms merely of 
line and space relationships. This condi- 
tion pertains, of course, to any group of 
music students; non-music students would 
necessarily have to rely upon the geometric 
pattern, the line-space distribution. 
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Sight Reading 





The data of this test cannot be used 
as indices for what ‘s generally called 
sight-reading ability. This includes repro- 
duction, either vocally, or on some other 
instrument, of the visual stimlus. Since 
the test did not involve such reproduction, 
it cannot give a complete sight-reading in- 
dex. The ability quickly and correctly to 
see the visual stimulus does not include 
the abilities to reproduce it. (However, 
the ability to reproduce it does include the 
ability to read it quickly and correctly.) 
The pupil making the second highest score in 
the notation test (95%) is conspicuously de- 
fective in reproducing visual stimuli on 
the piano, in spite of an adequate technical 
(kinesthetic) equipment. The pupil rating 
third highest (94%) is below average in 
sight reading. The pupils rating first, 
fourth and fifth are excellent sight read- 
ers. Other pupils making fairly high scores 
(between 80 and 90) vary in sight-reading 
ability from good to very poor. 

Such a distribution shows that when 
visual stimuli are presented in their ver- 
tical aspect alone they are sufficiently 
dissociated from the visual stimuli as used 
in actual musical compositions to include 
only one element of the sight-reading proc- 
ess. And it shows, conversely, that sight 
reading is a complex process which does not 
depend upon the perception of single note- 
elements. 

On the other hand, the principles un- 
derlying the reading of the stimuli as used 
in this test probably function in the read- 
ing of complex stimuli as well. 





